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HYBRID ADENOVIRUS-AAV VIRUS AND METHODS OF USE THEREOF 

r 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights in this invention. 

Field of the Invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof. 

Background of the Invention 

Recombinant adenoviruses are capable of 
providing extremely high levels of transgene delivery to 

15 virtually all cell types, regardless of the mitotic 
state. High titers (lo 13 plague forming units/ml) of 
recombinant virus can be easily generated in 293 cells 
(the adenovirus equivalent to retrovirus packaging cell 
lines) and cryo-stored for extended periods without 

20 appreciable losses. 

The primary limitation of this virus as a 
vector resides in the complexity of the adenovirus 
genome. A human adenovirus is comprised of a linear, 
approximately 36 kb double-stranded DNA genome, which is 
25 divided into 100 map units (m.u.), each of which is 360 
bp in length. The DNA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 
for viral DNA replication. The gene products are 
organized into early (El through E4) and late (LI through 
L5) regions, based on expression before or after the 
initiation of viral DNA synthesis [see, e.g., Horwitz, 
Virology , 2d edit., ed. B. N. Fields, Raven Press, Ltd. , 
New York (1990) ] . 

A human adenovirus undergoes a highly regulated 
35 program during its normal viral life cycle [Y. Yang et 
a1 ' Proc. Natl. Acad . Sci.. TTfiA r 91:4407-4411 (1994)]. 
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Virions are internalized by receptor-mediated endocytosis 
and transported to the nucleus where the immediate early 
genes, Ela and Elb, are expressed. Because these early 
gene products regulate expression of a variety of host 
5 genes (which prime the cell for ^irus production) and are 
central to the cascade activation of early delayed genes 
(e.g. E2, E3, and E4) followed by late genes (e.g. Ll-5) , 
first generation recombinant adenoviruses for gene 
therapy focused on the removal of the El domain. This 

10 strategy was successful in rendering the vectors 

replication defective, however, in vivo studies revealed 
transgene expression was transient and invariably 
associated with the development of severe inflammation at 
the site of vector targeting [S. Ishibashi et al, 

15 Plin. Invest .. 93:1885-1893 (1994); J. M. Wilson et al, 
Prnn . Natl. Sci. . USA . 85:4421-4424 (1988); J. M. 

Wilson et al, clin. Bio. . 3:21-26 (1991); M. Grossman et 

al. fin™. Cell, and Mol. Gen. . 12:601-607 (1991)]. 

Adeno-associated viruses (AAV) have also been 
employed as vectors. AAV is a small, single-stranded 
(ss) DNA virus with a simple genomic organization (4.7 
kb) that makes it an ideal substrate for genetic 
engineering. Two open reading frames encode a series of 
rep and cap polypeptides. J?ep polypeptides (rep78, 
25 rep68, rep62 and rep40) are involved in replication, 
rescue and integration of the AAV genome. The cap 
proteins (VP1, VP2 and VP3) form the virion capsid. 
Flanking the rep and cap open reading frames at the 5' 
and 3' ends are 145 bp inverted terminal repeats (ITRs) , 
the first 125 bp of which are capable of forming Y- or T- 
shaped duplex structures. Of importance for the 
development of AAV vectors, the entire rep and cap 
domains can be excised and replaced with a therapeutic or 
reporter transgene [B. J. Carter, in "Handbook of 
35 Parvoviruses", d., P. Tijsser, CRC Press, pp. 155-168 



20 
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(1990)]. It has been shown that the ITRs represent the 
minimal sequence required for replication, rescue, 
packaging, and integration of the AAV genome. 

The AAV life cycle is biphasic, composed of 
5 both latent and lytic episodes. During a latent 

infection, AAV virions enter a cell as an encapsidated 
ssDNA, and shortly thereafter are delivered to the 
nucleus where the AAV DNA stably integrates into a host 
chromosome without the apparent need for host cell 

10 division. In the absence of helper virus, the integrated 
ss DNA AAV genome remains latent but capable of being 
activated and rescued. The lytic phase of the life cycle 
begins when a cell harboring an AAV provirus is 
challenged with a secondary infection by a herpesvirus or 

15 adenovirus which encodes helper functions that are 
recruited by AAV to aid in its excision from host 
chromatin [B. J. Carter, cited above]. The infecting 
parental ssDNA is expanded to duplex replicating form 
(RF) DNAs in a rep dependent manner. The rescued AAV 

20 genomes are packaged into preformed protein capsids 

(icosahedral symmetry approximately 20 nm in diameter) 
and released as infectious virions that have packaged 
either + or - ss DNA genomes following cell lysis. 

Progress towards establishing AAV as a 

25 transducing vector for gene therapy has been slow for a 
variety of reasons. While the ability of AAV to 
integrate in quiescent cells is important in terms of 
long term expression of a potential transducing gene, the 
tendency of the integrated provirus to preferentially 

30 target only specific sites in chromosome 19 reduces its 
usefulness. Additionally, difficulties surround large- 
scale production of replication defective recombinants. 
In contrast to the production of recombinant retrovirus 
or adenovirus, the only widely recognized means for 

35 manufacturing transducing AAV virions entails co- 
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transfection with two different, yet complementing 
plasmids. One of these contains the therapeutic or 
reporter minigene sandwiched between the two cis acting 
AAV ITRs. The AAV components that are needed for rescue 
5 and subsequent packaging of progeny recombinant genomes 
are provided in trans by a second plasmid encoding the 
viral open reading frames for rep and cap proteins. The 
cells targeted for transfection must also be 
with adenovirus thus providing the necessary helper 

10 functions. Because the yield of recombinant AAV is 

dependent on the number of cells that are transfected 
with the cis and trans-acting plasmids, it is desirable 
to use a transfection protocol with high efficiency. For 
large-scale production of high titer virus, however, 

15 previously employed high efficiency calcium phosphate and 
liposome systems are cumbersome and subject to 
inconsistencies . 

There remains a need in the art for the 
development of vectors which overcome the disadvantages 

20 of the known vector systems. 

summar y of the Invention 

In one aspect, the present invention provides a 
unique recombinant hybrid adenovirus /AAV virus, which 

25 comprises an adenovirus capsid containing selected 

portions of an adenovirus sequence, 5' and 3' AAV ITR 
sequences which flank a selected transgene under the 
control of a selected promoter and other conventional 
vector regulatory components. This hybrid virus is 

30 characterized by high titer transgene delivery to a host 
cell and the ability to stably integrate the transgene 
into the host cell chromosome in the presence of the rep 
gene. In one embodiment, the transgene is a reporter 
gene. Another embodiment of the hybrid virus contains a 

35 therapeutic transgene. In a preferred embodiment, the 
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hybrid virus has associated therewith a polycation 
sequence and the AAV rep gene. This construct is termed 
the hybrid virus conjugate or trans-infection particle. 

In another aspect, the present invention 
5 provides a hybrid vector construct for use in producing 
the hybrid virus or viral particle described above. This 
hybrid vector comprises selected portions of an 
adenovirus sequence, 5' and 3 1 AAV ITR sequences which 
flank a selected transgene under the control of a 

10 selected promoter and other conventional vector 
regulatory components. 

In another aspect, the invention provides a 
composition comprising a hybrid viral particle for use in 
delivering a selected gene to a host cell. Such a 

15 composition may be employed to deliver a therapeutic gene 
to a targeted host cell to treat or correct a genetically 
associated disorder or disease. 

In yet another aspect, the present invention 
provides a method for producing the hybrid virus by 

20 transfecting a suitable packaging cell line with the 

hybrid vector construct of this invention. In another 
embodiment the method involves co-transf ecting a cell 
line (either a packaging cell line or a non-packaging 
cell line) with a hybrid vector construct and a suitable 

25 helper virus. 

In a further aspect, the present invention 
provides a method for producing large quantities of 
recombinant AAV particles with high efficiency by 
employing the above methods, employing the hybrid vector 
30 construct of this invention and collecting the rAAV 

particles from a packaging cell line transfected with the 
vector . 

Other aspects and advantages of the present 
invention are described further in the following detailed 
35 description of the preferred embodiments thereof. 
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RT-ief Description of tihe Drawings 

Fig. 1A is a schematic diagram of a vector 
construct p Ad. AV.CMVLa cZ [SEQ ID NO: 1], which contains 
(from the top in clockwise order) adenovirus sequence map 
units 0-1 (clear bar); the 5' AAV ITR (solid bar); a CMV 
immediate early enhancer /promoter (hatched arrow) , an 
SV40 intron (clear bar) , an E. coli beta-galactosidase 
cDNA (LacZ) (hatched line) , an SV40 polyadenylation 
signal (clear bar), a 3 • AAV ITR (solid bar), adenovirus 
sequence from map units 9-16 (clear bar) , and a portion 
of a pBR322 derivative plasmid (thin solid line) . 
Restriction endonuclease enzymes are identified by their 
conventional designations; and the location of each 
restriction enzyme is identification by the nucleotide 
15 number in parentheses to the right of the enzyme 

designation. 

Fig. IB is a schematic drawing demonstrating 

linearization of pAd . AV . CMVLacZ [SEQ ID NO: 1] by 
digestion with restriction enzyme Nhel and a linear 
20 arrangement of a Clal digested adenovirus type 5 with 
deletions from mu 0-1. The area where homologous 
recombination will occur (between m.u. 9-16) in both the 
plasmid and adenovirus sequences is indicated by crossed 

lines. 

25 Fig. 1C is a schematic drawing which 

demonstrates the hybrid virus Ad . AV . CMVLacZ after co- 
transfection of the linearized pAd . AV . CMVLacZ [SEQ ID NO: 
1] and adenovirus into 293 cells followed by 
intracellular homologous recombination. 

30 Fig. 2A-2K report the top DNA strand of the 

double-strand plasmid pAd . AV . CMVLacZ [SEQ ID NO: 1] (the 
complementary strand can be readily derived by one of 
skill in the art) . With reference to SEQ ID NO: 1, 
nucleotides 1-365 are adenovirus type 5 sequences; the 5' 

35 AAV ITR sequence spans nucleotides 366-538; the CMV 
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promoter/ enhancer spans nucleotides 563-1157; the SV-40 
intron spans nucleotides 1158-1179; the LacZ gene spans 
nucleotides 1356-4827; the SV-40 poly A sequence spans 
nucleotides 4839-5037; the 3' AAV ITR spans nucleotides 
5 5053 to 5221; nucleotides 5221 vo about 8100 are 

adenovirus type 5 sequences. The remaining sequences are 
non-specif ic/plasmid sequences. 

Fig. 3 is a bar graph plotting u.v. absorbance 
at 420 nm of the beta-galactosidase blue color for a 
10 control and ten putative positive clones (D1A through 

D1J) of 293 cells transfected with the recombinant hybrid 
Ad.AV.CMVLacZ. Eight of the clones expressed high levels 
of enzyme. 

Fig. 4 is a schematic diagram of pRep78/52 [SEQ 
15 ID NO: 2]. This plasmid includes an AAV P5 promoter, 
Rep78, Rep52 and a poly-A sequence in a pUC18 plasmid 
background. 

Figs. 5A - 5E report nucleotides 1-4910 of the 
top DNA strand of the double-strand plasmid pRep78/52 
20 [SEQ ID NO: 2] (the complementary strand can be readily 
derived by one of skill in the art) . 

Fig. 6 is a flow diagram of the construction of 
a trans-infection particle formed by a hybrid virus , a 
poly-L-lysine sequence and attached AAV rep-containing 
25 plasmid. 

Fig. 7 is a flow diagram of the hybrid virus 1 
life cycle , in which a trans-infection particle enters 
the cell and is transported to the nucleus. The virus is 
uncoated and the rep mediates rescue of the inserted 
30 gene, which is then integrated into the chromosome of the 
host cell. 
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nrtailed Desc ri ption of the Invention 

The present invention provides a unique gene 
transfer vehicle which overcomes many of the limitations 
of prior art viral vectors. This engineered hybrid virus 
5 contains selected adenovirus domains and selected AAV 
domains as well as a selected transgene and regulatory 
elements in a viral capsid. This novel hybrid virus 
solves the problems observed with other, conventional 
gene therapy viruses, because it is characterized by the 
10 ability to provide extremely high levels of transgene 
delivery to virtually all cell types (conferred by its 
adenovirus sequence) and the ability to provide stable 
long-term transgene integration into the host cell 
(conferred by its AAV sequences) . The adenovirus-AAV 
15 hybrid virus of this invention has utility both as a 

novel gene transfer vehicle and as a reagent in a method 
for large-scale recombinant AAV production. 

In a preferred embodiment, a trans-infection 
particle or hybrid virus conjugate composed of the hybrid 
20 Ad/ AAV virus conjugated to a rep expression plasmid via a 
poly- lysine bridge is provided. This trans- infection 
particle is advantageous because the adenovirus carrier 
can be grown to titers sufficient for high MOI infections 
of a large number of cells, the adenoviral genome is 
25 efficiently transported to the nucleus in nondividing 
cells as a complex facilitating transduction into 
mitotically quiescent cells, and incorporation of the rep 
plasmid into the trans-infection particle provides high 
but transient expression of rep that is necessary for 
30 both rescue of rAAV DNA and efficient and site-specific 
integration. 
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Construction of the Hybrid Vector and Virus 

A - The Adenovirus Component of the Vector and 

Virus 

The hybrid virus of this invention uses 
5 adenovirus nucleic acid sequences as a shuttle to deliver 
a recombinant AAV/transgene genome to a target cell. The 
DNA sequences of a number of adenovirus types , including 
type Ad5, are available from Genbank. The adenovirus 
sequences may be obtained from any known adenovirus type, 

10 including the presently identified 41 human types 

[Horwitz et al, cited above]. Similarly adenoviruses 
known to infect other animals may also be employed in the 
vector constructs of this invention. The selection of 
the adenovirus type is not anticipated to limit the 

15 following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
Rockville, Maryland, or available by request from a 
variety of commercial and institutional sources. In the 
following exemplary embodiment an adenovirus, type 5 

20 (Ad5) is used for convenience. 

The adenovirus nucleic acid sequences 
employed in the hybrid vector of this invention can range 
from a minimum sequence amount, which requires the use of 
a helper virus to produce the hybrid virus particle, to 

25 only selected deletions of adenovirus genes, which 

deleted gene products can be supplied in the hybrid viral 
production process by a selected packaging cell. 
Specifically, at a minimum, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 

30 invention are adenovirus genomic sequences from which all 
viral genes are deleted and which contain only those 
adenovirus sequences required for packaging adenoviral 
genomic DNA into a preformed capsid head. More 
specifically, the adenovirus sequences employed are the 

35 cis-acting 5 1 and 3' inverted terminal repeat (ITR) 
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sequences of an adenovirus (which function as origins of 
replication) and the native 5' packaging/ enhancer domain, 
that contains sequences necessary for packaging linear Ad 
genomes and enhancer elements for the El promoter. 
5 According to this invention, the entire adenovirus 5' 
sequence containing the 5' ITR and packaging/ enhancer 
region can be employed as the 5' adenovirus sequence in 
the hybrid virus. This left terminal (5') sequence of 
the Ad5 genome useful in this invention spans bp 1 to 
10 about 360 of the conventional adenovirus genome, also 
referred to as map units 0-1 of the viral genome, and 
generally is from about 353 to about 360 nucleotides in 
length. This sequence includes the 5' ITR (bp 1-103 of 
the adenovirus genome); and the packaging/ enhancer domain 
15 (bp 194-358 of the adenovirus genome) . Preferably, this 
native adenovirus 5 1 region is employed in the hybrid 
virus and vector in unmodified form. Alternatively, 
corresponding sequences from other adenovirus types may 
be substituted. These Ad sequences may be modified to 
20 contain desired deletions, substitutions, or mutations, 
provided that the desired function is not eliminated. 

The 3' adenovirus sequences of the hybrid virus 
include the right terminal (3«) ITR sequence of the 
adenoviral genome spanning about bp 35,353 - end of the 
25 adenovirus genome, or map units -98.4-100. This sequence 
is generally about 580 nucleotide in length. This entire 
sequence is desirably employed as the 3' sequence of a 
hybrid virus. Preferably, the native adenovirus 3 1 
region is employed in the hybrid virus in unmodified 
30 form. However, as described above with respect to the 5' 
sequences, some modifications to th se s quences which do 
not adversely ffect their biological function may be 
acceptable. As described below, when these 5' and 3' 
adenovirus sequ nces are employ d in the hybrid vector, a 
35 helper adenovirus which supplies all other essential 
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genes for viral formation alone or with a packaging cell 
line is required in the production of the hybrid virus or 
viral particle. 

Alternative embodiments of the hybrid 
5 virus employ adenovirus sequences in addition to the 

minimum sequences, but which contain deletions of all or 
portions of adenovirus genes. For example, the 
adenovirus immediate early gene Ela (which spans mu 1.3 
to 4.5) and delayed early gene Elb (which spans mu 4.6 to 

10 11.2) should be deleted from the adenovirus sequence 
which forms a part of the hybrid vector construct and 
virus. Alternatively, if these sequences are not 
completely eliminated, at least a sufficient portion of 
the Ela and Elb sequences must be deleted so as to render 

15 the virus replication defective. These deletions, 
whether complete or partial, which eliminate the 
biological function of the gene are termed "functional 
deletions" herein. 

Additionally, all or a portion of the 

20 adenovirus delayed early gene E3 (which spans mu 76.6 to 
86.2) may be eliminated from the adenovirus sequence 
which forms a part of the hybrid virus. The function of 
E3 is irrelevant to the function and production of the 
hybrid virus. 

25 All or a portion of the adenovirus delayed 

early gene E2a (which spans mu 67.9 to 61.5) may be 
eliminated from the hybrid virus. It is also anticipated 
that portions of the other delayed early genes E2b (which 
spans mu 29 to 14.2) and E4 (which spans mu 96.8 to 91.3) 

30 may also be eliminated from the hybrid virus and from the 
vector . 

Deletions may also be made in any of the 
late genes LI through L5, which span mu 16.45 to 99 of 
the adenovirus genome. Similarly, deletions may be 
35 useful in the intermediate genes IX which maps between mu 
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9.8 and 11.2 and IVa 2 which maps between 16.1 to 11.1. 
Other deletions may occur in the other structural or non- 
structural adenovirus. 

The above discussed deletions may occur 
5 individually, i.e., an adenovirus sequence for use in the 
present invention may contain deletions of El only. 
Alternatively, deletions of entire genes or portions 
effective to destroy their biological activity may occur 
in any combination. For example, in one exemplary hybrid 

10 vector, the adenovirus sequence may contain deletions of 
the El genes and the E3 gene, or of the El, E2a and E3 
genes, or of the El and E4 genes, or of El, E2a and E4 
genes, with or without deletion of E3, and so on. 

The more deletions in the adenovirus 

15 sequence up to the minimum sequences identified above 
that characterize the hybrid virus, the larger the 
sequence (s) of the other below-described components to be 
inserted in the hybrid vector. As described above for 
the minimum adenovirus sequences, those gene sequences 

20 not present in the adenovirus portion of the hybrid virus 
must be supplied by either a packaging cell line and/or a 
helper adenovirus to generate the hybrid virus. 

In an exemplary hybrid virus of this invention 
which is described below and in Example 1, the adenovirus 

25 genomic sequences present are from mu 0 to 1, mu 9 to 
78.3 and mu 86 to 100 (deleted sequences eliminate the 
Ela and Elb genes and a portion of the E3 gene) . From 
the foregoing information, it is expected that one of 
skill in the art may construct hybrid vectors and viruses 

30 containing more or less of the adenovirus gene sequence. 

The portions of the adenovirus genome in 
the hybrid virus permit high production titers of the 
virus to be produced, often greater than lxlO 13 pfu/ml. 
This is in stark contrast to the low titers (lxlO 6 

35 pfu/ml) that have been found for recombinant AAV. 
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B. The AAV Components of the Vector and Virus 
Also part of the hybrid vectors and 
viruses of this invention are sequences of an adeno- 
associated virus. The AAV sequences useful in the hybrid 
5 vector are the viral sequences Vrom which the rep and cap 
polypeptide encoding sequences are deleted. More 
specifically, the AAV sequences employed are the cis- 
acting 5 f and 3 1 inverted terminal repeat (ITR) sequences 
[See, e.g., B. J. Carter, in "Handbook of Parvoviruses", 

10 ed., P. Tijsser, CRC Press, pp. 155-168 (1990)]. As 
stated above, the ITR sequences are about 143 bp in 
length. Substantially the entire sequences encoding the 
ITRs are used in the vectors, although some degree of 
minor modification of these sequences is expected to be 

15 permissible for this use. See, e.g., WO 93/24641, 

published December 9, 1993. The ability to modify these 
ITR sequences is within the skill of the art. For 
suitable techniques, see, e.g., texts such as Sambrook et 
al, "Molecular Cloning. A Laboratory Manual.", 2d edit., 

20 Cold Spring Harbor Laboratory, New York (1989). 

The AAV ITR sequences may be obtained from 
any known AAV, including presently identified human AAV 
types. Similarly, AAVs known to infect other animals may 
also be employed in the vector constructs of this 

25 invention. The selection of the AAV is not anticipated 
to limit the following invention. A variety of AAV 
strains, types 1-4, are available from the American Type 
Culture Collection or available by request from a variety 
of commercial and institutional sources. In the 

30 following exemplary embodiment an AAV- 2 is used for 
conveni nee. 

In the hybrid vector construct, the AAV 
sequences are flanked by the selected adenovirus 
sequences discussed above. The 5' and 3 1 AAV ITR 
35 sequences themselves flank a selected transgene sequence 
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and associated regulatory elements, described below. 
Thus, the sequence formed by the transgene and flanking 
5' and 3 1 AAV sequences may be inserted at any deletion 
site in the adenovirus sequences of the vector. For 
5 example, the AAV sequences are desirably inserted at the 
site of the deleted Ela/Elb genes of the adenovirus, 
i.e., after map unit 1. Alternatively, the AAV sequences 
may be inserted at an E3 deletion, E2a deletion, and so 
on. If only the adenovirus 5' ITR/ packaging sequences 
10 and 3 1 ITR sequences are used in the hybrid virus, the 
AAV sequences are inserted between them. 

C. The Transaene Component of th e Hybrid 

Vector and Virus 

The transgene sequence of the vector and 

15 recombinant virus is a nucleic acid sequence or reverse 
transcript thereof, heterologous to the adenovirus 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 

20 transcription. 

The composition of the transgene sequence 
will depend upon the use to which the resulting hybrid 
vector will be put. For example, one type of transgene 
sequence includes a reporter sequence, which upon 

25 expression produces a detectable signal. Such reporter 
sequences include without limitation an E. coli beta- 
galactosidase (LacZ) cDNA, an alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 

30 their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. 

Another type of transgene sequence 
includes a therapeutic gene which expresses a desired 

35 gene product in a host cell. These therapeutic gen s or 
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nucleic acid sequences typically encode products for 
.administration and expression in a patient in vivo or ex 
vivo to replace or correct an inherited or non-inherited 
genetic defect or treat an epigenetic disorder or 
5 disease. Such therapeutic gene&> which are desirable for 
the performance of gene therapy include, without 
limitation, a normal cystic fibrosis transmembrane 
regulator (CFTR) gene, a low density lipoprotein (LDL) 
gene, and a number of genes which may be readily selected 

10 by one of skill in the art. The selection of the 

transgene is not considered to be a limitation of this 
invention, as such selection is within the knowledge of 
those skilled in the art. 

D. Regulatory Elements of the Hybrid Vector 

15 In addition to the major elements 

identified above for the hybrid vector, i.e., the 
adenovirus sequences, AAV sequences and the transgene, 
the vector also includes conventional regulatory elements 
necessary to drive expression of the transgene in a cell 

20 transfected with the hybrid vector. Thus the vector 
contains a selected promoter which is linked to the 
transgene and located, with the transgene, between the 
AAV ITR sequences of the vector. 

Selection of the promoter is a routine 

25 matter and is not a limitation of the hybrid vector 

itself. Useful promoters may be constitutive promoters 
or regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 

30 cytomegalovirus immediate early promoter /enhancer [see, 
e.g., Boshart et al, Cell , 41:521-530 (1985)]. Other 
desirable promoters include, without limitation, the Rous 
sarcoma virus LTR promoter /enhancer and the chicken p- 
actin promoter. Still other promoter /enhancer sequences 

35 may be selected by one of skill in the art. 
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The vectors will also desirably contain 
nucleic acid sequences heterologous to the adenovirus 
sequences including sequences providing signals required 
for efficient polyadenylation of the transcript and 
5 introns with functional splice dunor and acceptor sites. 
A common poly-A sequence which is employed in the 
exemplary vectors of this invention is that derived from 
the papovavirus SV-40. The poly-A sequence generally is 
inserted in the vector following the transgene sequences 

10 and before the 3 ' AAV ITR sequence . A common intron 

sequence is also derived from SV-40, and is referred to 
as the SV-40 T intron sequence. A hybrid vector of the 
present invention may also contain such an intron, 
desirably located between the promoter /enhancer sequence 

15 and the transgene. Selection of these and other common 
vector elements are conventional and many such sequences 
are available [see, e.g., Sambrook et al, and references 
cited therein] . The DNA sequences encoding such 
regulatory regions are provided- in the plasmid sequence 

20 of Fig. 2 [SEQ ID NO: 1]. 

The combination of the transgene, 
promoter /enhancer, the other regulatory vector elements 
and the flanking 5' and 3' AAV ITRs are referred to as a 
"minigene" for ease of reference herein. As above 

25 stated, the minigene is located in the site of any 

selected adenovirus deletion in the hybrid virus. The 
size of this minigene depends upon the amount and number 
of adenovirus sequence deletions referred to above. Such 
a minigene may be about 8 kb in size in the exemplary 

30 virus deleted in the El and E3 genes, as described in the 
examples below. Alternatively, if only the minimum 
adenovirus sequences are employed in the virus, this 
minigene may be a size up to about 30 kb. Thus, this 
hybrid vector and vector permit a great deal of latitude 

35 in the selection of the various components of the 
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minigene, particularly the transgene, with regard to 
size. Provided with the teachings of this invention, the 
design of such a minigene can be made by resort to 
conventional techniques. 
5 E. Hybrid Vector Assembly and Production of 

Hybrid Vixrus 

The material from which the sequences used 
in the hybrid vector, helper viruses, if needed, and 
recombinant hybrid virus (or viral particle) are derived 

10 and the various vector components and sequences employed 
in the construction of the hybrid vectors of this 
invention are obtained from commercial or academic 
sources based on previously published and described 
materials. These materials may also be obtained from an 

15 individual patient or generated and selected using 

standard recombinant molecular cloning techniques known 
and practiced by those skilled in the art. Any 
modification of existing nucleic acid sequences forming 
the vectors and viruses, including sequence deletions, 

20 insertions, and other mutations are also generated using 
standard techniques. 

Assembly of the selected DNA sequences of 
the adenovirus, the AAV and the reporter genes or 
therapeutic genes and other vector elements into the 

25 hybrid vector and the use of the hybrid vector to produce 
a hybrid virus utilize conventional techniques, such as 
described in Example 1. Such techniques include 
conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, cited above], use of 

30 overlapping oligonucleotide sequences of the adenovirus 
and AAV genomes, polymerase chain reaction, and any 
suitable method which provides th desired nucleotide 
sequence. Standard transfection and co-transf ection 
techniques are employed, e.g., CaP0 4 transfection 

35 techniques using the complementation human embryonic 
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kidney (HEK) 293 cell line (a human kidney cell line .. 
containing a functional adenovirus Ela gene which 
provides a transacting Ela protein) . Other conventional 
methods employed in this invention include homologous 
5 recombination of the viral genomos, plaguing of viruses 
in agar overlay, methods of measuring signal generation, 
and the like. 

As described in detail in Example 1 below 
and with resort to Fig. 1, a unique hybrid virus of this 

10 invention is prepared which contains an El-deleted, 

partially E3 deleted, adenovirus sequence associated with 
a single copy of a recombinant AAV having deletions of 
its rep and cap genes and encoding a selected reporter 
transgene. Briefly, this exemplary hybrid virus was 

15 designed such that the AV.CMVLacZ sequence [SEQ ID NO: 1] 
(a minigene containing a 5 'AAV ITR, a CMV promoter, an 
SV-40 intron, a LacZ transgene, an SV-40 poly-A sequence 
and a 3' AAV ITR) was positioned in place of the 
adenovirus type 5 (Ad5) Ela/Elb genes, making the 

20 adenovirus vector replication defective. 

Because of the limited amount of 
adenovirus sequence present in the hybrid vectors of this 
invention, including the pAV.CMVXacZ [SEQ ID NO: 1] 
above, a packaging cell line or a helper adenovirus or 

25 both may be necessary to provide sufficient adenovirus 

gene sequences necessary for a productive viral infection 

to generate the hybrid virus. 

Helper viruses useful in this invention 
contain selected adenovirus gene sequences not present in 
30 the hybrid vector construct or xpressed by the cell line 
in which the hybrid vector is transfected. Optionally, 
such a helper virus may contain a second reporter 
minigene which enables separation of the resulting hybrid 
virus and the helper virus upon purification. The 
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construction of desirable helper viruses is within the 
skill of the art. 

As one example, if the cell line employed 
to produce the recombinant virus is not a packaging cell 
5 line, and the hybrid vector contains only the minimum 
adenovirus sequences identified above, the helper virus 
may be a wild type Ad virus. Thus, the helper virus 
supplies the necessary adenovirus early genes El, E2a, E4 
and all remaining late, intermediate, structural and non- 
10 structural genes of the adenovirus genome. However, if, 
in this situation, the packaging cell line is 293, which 
supplies the El proteins, the helper virus need not 
contain the El gene. 

In another embodiment, when the hybrid 
15 construct is rendered replication defective by a 

functional deletion in El but contains no other deletions 
in Ad genes necessary for production of an infective 
viral particle, and the 293 cell line is employed, no 
helper virus is necessary for production of the hybrid 
20 virus. Additionally, all or a portion of the adenovirus 
delayed early gene E3 (which spans mu 76.6 to 86.2) may 
be eliminated from the helper virus useful in this 
invention because this gene product is not necessary for 
the formation of a functioning hybrid virus particle. 
25 It should be noted that one of skill in 

the art may design other helper viruses or develop other 
packaging cell lines to complement the adenovirus 
deletions in the vector construct and enable production 
of the hybrid virus particle, given this information. 
30 Therefore, this invention is not limited by the use or 
description of any particular helper virus or packaging 
cell line. 

Thus, as described in Figs. 1A through 1C, 
the circular plasmid pAd.AV.CMVLacZ [SEQ ID NO: 1] 
35 (containing the minigene and only adenovirus sequences 
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from map unit 0 tc 1 and 9 to 16) was digested and co-r 
transfected with a selected Ad5 helper virus (containing 
adenovirus sequences 9 to 78.4 and 86 to 100) into 293 
cells. Thus, the packaging cell line provides the El 
5 proteins and the helper virus provides all necessary 
adenovirus gene sequences subsequent to map unit 16. 
Homologous recombination occurs between the helper virus 
and the hybrid vector, resulting in the hybrid viral 
particle. Growth of this hybrid viral particle in 293 
10 cells has been closely monitored for greater than 20 
rounds of amplification with no indication of genome 
instability. Rescue and integration of the transgene 
from the hybrid virus into a host cell and further 
modifications of the vector are described below. The 
15 resulting hybrid virus Ad . AV . CKVLacZ combines the high 
titer potential of adenovirus with the integrating 
biology associated with AAV latency. 

G. HvJbrid Virus Polycation C onjugates 

Rep expression is required for rescue of 
20 the rAAV genome to occur. A preferred approach is to 

synthetically incorporate a plasmid permitting expression 
of rep into the hybrid particle. To do so, the hybrid 
viruses described above are further modified by resort to 
adenovirus-poly lysine conjugate technology. See, e.g., 
25 Wu et al, J. Biol. Chem. . 264:16985-16987 (1989); and K. 
J. Fisher and J. M. Wilson, Biochem. J. » 299 ; 49 (April 
1, 1994), incorporated herein by reference. Using this 
technology, a hybrid virus as described above is modified 
by the addition of a poly-cation sequence distributed 
30 around the capsid of the hybrid viral particle. 

Preferably, the poly-cation is poly-lysine, which 
attaches around the negatively-charged virus to form an 
external positive charge. A plasmid containing the AAV 
rep gene (or a functional portion thereof) under the 
35 control of a suitable promoter is then complexed directly 
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to the hybrid capsid, resulting in a single viral 
particle containing the hybrid virus and an AAV rep gene. 
The negatively charged plasmid DNA binds with high 
affinity to the positively charged poly lysine. 
5 Essentially the techniques employed in constructing this 
hybrid virus conjugate or trans-infection particle are as 
described in detail in Example 3 below. 

An alternative embodiment of the hybrid 
vector and resulting viral particle is provided by 

10 altering the rep containing plasmid to also contain an 

AAV cap gene. This embodiment of the hybrid vector when 
in a host cell is thus able to produce a recombinant AAV 
particle, as discussed in more detail below. 

The plasmids employed in these embodiments 

15 contain conventional plasmid sequences, which place a 
selected AAV sequence, i.e., rep and/or cap gene 
sequences, under the control of a selected promoter. In 
the example provided below, the exemplary plasmid is 
pRep78/52 [SEQ ID NO: 2], a trans-acting plasmid 

20 containing the AAV sequences that encode rep 78 kD and 52 
kD proteins under the control of the AAV P5 promoter. 
The plasmid also contains an SV40 polyadenylation signal. 
The DNA sequence of this plasmid is provided in Fig. 8 
[SEQ ID NO: 2]. 

25 In a similar manner and with resort to 

plasmid and vector sequences known to the art, analogous 
plasmids may be designed using both rep and cap genes, 
and different constitutive or regulated promoters, 
optional poly-A sequences and introns. 

30 The availability of materials to make 

these modified hybrid vectors and viruses and the AAV rep 
and/ or cap containing vectors and the techniques involved 
in the assembly of the hybrid vector and rep and/or cap 
containing plasmids are conventional as described above. 

35 The assembly techniques for the trans-infection particle 
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employ the techniques described above for the hybrid 
vector and the techniques of Wu et al and Fisher et al, 
cited above. The use of this trans-infection particle 
including rescue and integration of the transgene into 
5 the host cell is described below; 

II. Function of the Hybrid Virus 

A. The Hybrid Virus Infects a Target Cell 

Once the hybrid virus or trans-infection 

10 particle is constructed as discussed above, it is 

targeted to, and taken up by, a selected target cell. 
The selection of the target cell also depends upon the 
use of the hybrid virus, i.e., whether or not the 
transgene is to be replicated in vitro for production of 

15 a recombinant AAV particle, or ex vivo for production 
into a desired cell type for redelivery into a patient, 
or in vivo for delivery to a particular cell type or 
tissue. Target cells may therefor be any mammalian cell 
(preferably a human cell). For- example, in in vivo use, 

20 the hybrid virus can target to any cell type normally 
infected by adenovirus, depending upon the route of 
administration, i.e., it can target, without limitation, 
neurons, hepatocytes, epithelial cells and the like. 
Uptake of the hybrid virus by the cell is caused by the 

25 infective ability contributed to the vector by the 
adenovirus and AAV sequences. 

B. The Transgene is Rescued. 

Once the hybrid virus or trans-infection 
particle is taken up by a cell, the AAV ITR flanked 

30 transgene must be rescu d from the parental adenovirus 
backbone. Rescue of the transgene is dependent upon 
supplying the infected cell with an AAV rep gene. Thus, 
efficacy of the hybrid virus can be measured in terms of 
rep mediated rescue of rAAV from the parental adenovirus 

35 template. 
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Tha rep genes can be supplied to the 
hybrid virus by several methods. One embodiment for 
providing rep proteins in trans was demonstrated with the 
exemplary hybrid virus Ad.AV.CMVLacZ by transfecting into 
5 the target monolayer of cells previously infected with 

the hybrid vector, a liposome enveloped plasmid pRep78/52 
[SEQ ID NO: 2] containing the genes encoding the AAV rep 
78 kDa and 52 kDa proteins under the control of the AAV 
P5 promoter. Rescue and amplification of a double- 
10 stranded AAV monomer and a double-stranded AAV dimer, 

each containing the LacZ transgene described above , was 
observed in 293 cells. This is described in detail in 
Example 2. 

The production of rep in trans can be 

15 modulated by the choice of promoter in the rep containing 
plasmid. If high levels of rep expression are important 
early for rescue of the recombinant AAV domain, a 
heterologous (non-adenovirus, non-AAV) promoter may be 
employed to drive expression of, rep and eliminate the 

20 need for El proteins. Alternatively, the low levels of 
rep expression from P5 that occur in the absence of 
adenovirus El proteins may be sufficient to initiate 
rescue and optimal to drive integration of the 
recombinant AAV genome in a selected use. 

25 More preferably for in vivo use, the AAV 

rep gene may also be delivered as part of the hybrid 
virus. One embodiment of this single particle concept is 
the polycation conjugated hybrid virus (see Fig. 7) . 
Infection of this trans-infection particle is 

30 accomplished in the same manner and with regard to the 
same target cells as identified above. The poly lysine 
conjugate of the hybrid virus onto which was directly 
complexed a plasmid that encoded the rep 78 and 52 
proteins, combines all of the functional components into 

35 a single particle structure. Thus, the trans-infection 
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particle permits delivery of a single particle to the 
cell, which is considerably more desirable for 
therapeutic use. Similar experiments to demonstrate 
rescue of the transgene from the hybrid conjugate trans- 
infection particle in 293 cells end in HeLa cells are 

detailed in Example 4. 

In another embodiment, the hybrid virus is 

modified by cloning the rep cDNA directly into the 
adenovirus genome portion of the hybrid vector. Because 
it is known that even residual levels of rep expression 
can interfere with replication of adenovirus DNA, such 
incorporation of rep into the hybrid vector itself is 
anticipated to requires possible mutation of the rep 
genes to encode only selected domains, and the use of 
15 inducible promoters to regulate rep expression, as well 

as careful placement of the rep genes into the adenovirus 

sequences of the hybrid vector. 

C. Transaene Integrates in to Chromosome 

Once uncoupled (rescued) from the genome 

20 of the hybrid virus, the recombinant AAV/ transgene 

minigene seeks an integration site in the host chromatin 
and becomes integrated therein, providing stable 
expression of the accompanying transgene in the host 
cell. This aspect of the function of the hybrid virus is 

25 important for its use in gene therapy. The AAV/ 

transgene minigene sequence rescued from the hybrid virus 
achieves provirus status in the target cell, i.e., the 
final event in the hybrid lifecycle (Fig. 7) . 

To determine whether the AAV minigene 

30 rescued from the hybrid virus achieves provirus status in 
a target cell, non-El expressing HeLa cells were infected 
with the hybrid vector-poly-Lysine conjugate complexed 
with pRep78/52 [SEQ ID NO: 2] and passaged until stable 
colonies of LacZ expressing cells are evident. A 

35 duplicate plate of cells was infected with the same 
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conjugate, but instead of being complexed with the " 
pRep78/52 plasmid [SEQ ID NO: 2], carried an irrelevant 
plasmid. Cells that receive the rep containing hybrid 
particle produced a greater number of stable LacZ 
5 positive colonies than cells injected with the control 
vector. This indicates multiple rescue and integration 
events in cells that expressed rep proteins. 
Confirmation of integration is revealed by characterizing 
the recombinant AAV genome in the hybrid infected cells 
10 and identifying flanking chromosomal sequences (see 
Example 5) . 

III. Use of the Hybrid Viruses and Viral Particles 

in Gene Therapy 

15 The novel hybrid virus and trans-infection 

particles of this invention provide efficient gene 
transfer vehicles for somatic gene therapy. These hybrid 
viruses are prepared to contain a therapeutic gene in 
place of the LacZ reporter transgene illustrated in the 

20 exemplary vector. By use of the hybrid viruses and 
trans-infection particles containing therapeutic 
transgenes, these transgenes can be delivered to a 
patient in vivo or ex vivo to provide for integration of 
the desired gene into a target cell. Thus, these hybrid 

25 viruses and trans-infection particles can be employed to 
correct genetic deficiencies or defects. Two examples of 
the generation of gene transfer vehicles for the 
treatment of cystic fibrosis and familial 
hypercholesterolemia are described in Examples 6 and 7 

30 below. One of skill in the art can generate any number 
of other gene transfer vehicles by including a selected 
transgene for the treatment of other disorders. For 
example, the trans-infection particles are anticipated to 
be particularly advantageous in ex vivo gene therapy 
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where transduction and proviral integration in a stem? 
cell is desired , such as in bone marrow directed gene 
therapy . 

The hybrid viruses and trans-infection 
5 particles of the present invention may be administered to 
a patient, preferably suspended in a biologically 
compatible solution or pharmaceutically acceptable 
delivery vehicle. A suitable vehicle includes sterile 
saline. Other aqueous and non-aqueous isotonic sterile 

10 injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutically acceptable 
carriers and well known to those of skill in the art may 
be employed for this purpose. 

The hybrid viruses and trans-infection 

15 particles of this invention may be administered in 

sufficient amounts to transfect the desired cells and 
provide sufficient levels of integration and expression 
of the selected transgene to provide a therapeutic 
benefit without undue adverse or with medically 

20 acceptable physiological effects which can be determined 
by those skilled in the medical arts. Conventional and 
pharmaceutically acceptable routes of administration 
include direct delivery to the target organ, tissue or 
site, intranasal, intravenous, intramuscular, 

25 subcutaneous, intradermal, oral and other parental routes 
of administration. Routes of administration may be 

combined , if desired . 

Dosages of the hybrid virus and/ or trans- 
infection particle will depend primarily on factors such 

30 as the condition being treated, the selected gene, the 

age, weight and health of the patient, and may thus vary 
among patients. A therapeutically effective human dose 
of the hybrid viruses or trans-infection particles of the 
present invention is believed to be in the range of from 

35 about 20 to about 50 ml of saline solution containing 
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concentrations of from about 1 x 10 7 to 1 x 10 10 pfu/ml 
hybrid virus of the present invention. A preferred human 
dose is about 20 ml saline solution at the above 
concentrations. The dosage will be adjusted to balance 
5 the therapeutic benefit against any side effects. The 
levels of expression of the selected gene can be 
monitored to determine the selection, adjustment or 
frequency of dosage administration. 

10 IV. High Efficiency Production of rAAV 

The hybrid viruses and trans- infection 
particles of this invention have another desirable 
utility in the production of large quantities of 
recombinant AAV particles. Due to the complicated 

15 current methods for generating AAV, there is only a 

limited amount of AAV available for use in industrial, 
medical and academic biotechnology procedures. The 
vectors and viruses of the present invention provide a 
convenient and efficient method for generating large 

20 quantities of rAAV particles. 

According to this aspect of the invention, a 
trans-infection particle is constructed as described 
above and in Example 3 and is employed to produce high 
levels of rAAV as detailed in Example 8, with the 

25 possible modifications described in Example 9 below. 

Briefly, a plasmid is generated that contains both AAV 
rep and cap genes under the control of a suitable plasmid 
and is complexed to the poly-iysine exterior of the 
hybrid virus as described above. This trans-infection 

30 particle is then permitted to infect a selected host 

cell, such as 293 cells. The presence of both rep and 
cap permit the formation of AAV particles in the cells 
and generate an AAV virus titer of about 10 9 virions. In 
contrast, current methods involving the transfection of 

35 multiple plasmids produce only about 10 7 virion titer. 
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The rAAV is isolated from the culture by selecting the 
LacZ-containing blue plaques and purifying them on a 
cesium chloride gradient. 

The benefit of this procedure relates to the 
5 fact that the cis AAV element is encoded by the parental 
adenovirus genome. As a result, the tretns plasmid is the 
only DNA component that is needed for complex formation. 
The cell is thereby loaded with significantly more copies 
of the trans-acting rep and cap sequences, resulting in 

10 improved efficiency of rescue and packaging. 

Numerous comparative studies focusing on the 
optimal ratio and copy number of the cis and trans 
plasmids for AAV production indicated that there is a 
positive correlation between the trans plasmid copy 

15 number and yield of recombinant virus. As described in 
detail in Example 8, the yield of recombinant AV.CMVLacZ 
virus was increased by 5-10 fold by using the trans- 
infection particle instead of a standard adenovirus 

vector . ^ 

20 The primary limitation associated with the 

production of recombinant AAV using a hybrid virus of 
this invention relates to difficulties that arise in 
distinguishing between the two viruses (i.e., adenovirus 
and AAV) that are produced by the cell. Using the 

25 exemplary vectors and vector components of this 

invention, LacZ histochemical staining could not be used 
to titer the yield of recombinant AV.CMVLacZ since any 
contaminating Ad. AV.CMVLacZ hybrid would contribute to 
the final count. Therefore, a rapid Southern blot 

30 technique for quant itating yields of recombinant AAV was 
incorporated. The assay that was developed enabled not 
only quantitation and verification of AAV production, but 
also demonstrated the removal of contaminating hybrid 
virus from purified AAV stocks. 
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Another method for detecting contaminating 
hybrid virions involves modifying the hybrid vector by 
inserting a small second reporter minigene (i.e., 
reporter gene, promoter and other expression control 
5 sequences, where desired) into vhe E3 region of the 

parental adenovirus backbone. Because this reporter is 
not linked to the AAV domain, contaminating hybrid virus 
that is present during purification can be monitored by 
this hybrid-specific marker. Another possible reporter 

10 gene is the nucleic acid sequence for green fluorescent 

protein. With this hybrid vector containing two reporter 
sequences, histochemical staining for alkaline 
phosphatase (adenovirus reporter) or p-galactosidase (AAV 
reporter) activity can be used to monitor each viral 

15 domain. 

The following examples illustrate the 
construction and testing of the hybrid vectors of the 
present invention and the use thereof in the productions 
of recombinant AAV. These examples are illustrative 
20 only, and do not limit the scope of the present 
invention. 

Example 1 - Construction of a Hybrid Virus 

A first hybrid adenovirus -AAV virus was 

25 engineered by homologous recombination between DNA 

extracted from an adenovirus and a complementing vector 
according to protocols previously described [see, e.g., 
K. F. Kozarsky et al, J. Biol. Chem. . 269 x13695-13702 
(1994) and references cited therein]. The following 

30 description refers to the diagram of Fig. 1. 

Adenovirus DNA was extracted from CsCl purified 
dl7001 virions, an Ad5 (serotype subgroup C) variant that 
carries a 3 kb deletion between mu 78.4 through 86 in the 
nonessential E3 region (provided by Dr. William Wold, 

35 Washington University, St. Louis, Missouri). Adenoviral 
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DNA was prepared for co-transf ection by digestion with 
Clal (adenovirus genomic bp position 917) which removes 
the left arm of the genome encompassing adenovirus map 
units 0-2.5, See lower diagram of Fig. IB. 
5 The complementing hybrid vector, pAd.AV.CMVLacZ 

(see Fig. 1A and Fig. 2 [SEQ ID NO: 1]) was constructed 

as follows: 

A parental cloning vector, pAd.Bglll was 
designed. It contains two segments of wild-type Ad5 

10 genome (i.e., map units 0-1 and 9-16.1) separated by a 
unique Bglll cloning site for insertion of heterologous 
sequences. The missing Ad5 sequences between the two 
domains (adenovirus genome bp 361-3327) results in the 
deletion of Ela and the majority of Elb following 

15 recombination with viral DNA. 

A recombinant AAV genome (AV. CMVLacZ) was 
designed and inserted into the Bglll site of pAd.Bglll to 
generate the complementing plasmid. The linear 
arrangement of AV.CMVLacZ [SEQ ID NO: 1] (see top diagram 

20 of Fig. IB) includes: 

(a) the 5 1 AAV ITR (bp 1-173) obtained by PCR 
using pAV2 [C. A. Laughlin et al, Gene , 23: 65-73 (1983)] 
as template [nucleotide numbers 365-538 of Fig. 2 [SEQ ID 
NO: 1]]; 

25 (b) a CMV immediate early enhancer /promoter 

[Boshart et al, Cell , 41:521-530 (1985); nucleotide 
numbers 563-1157 of Fig. 2 [SEQ ID NO: 1]], 

(c) an SV40 splice donor-splice acceptor 
(nucleotide numbers 1178-1179 of Fig. 2 [SEQ ID NO: 1]), 

30 (d) E. coli beta-galactosidase cDNA 

(nucleotide numbers 1356 - 4827 of Fig. 2 [SEQ ID NO: 

1]), 

(e) an SV40 polyadenylation signal (a 237 Bam 
HI-BclI restriction fragment containing the 
35 cleavage/poly-A signals from both the early and late 
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transcription units; nucleotide numbers 4839 - 5037 of 
Fig. 2 [SEQ ID NO: 1]) and 

(f) 3 'AAV ITR, obtained from pAV2 as a SnaBI- 
Bglll fragment (nucleotide numbers 5053 - 5221 of Fig. 2 

5 [SEQ ID NO: 1]) . 

The resulting complementing hybrid vector , 
pAd.AV.CMVLacZ (see Fig. 1A and Fig. 2 [SEQ ID NO: 1]), 
contained a single copy of recombinant AV.CMVLacZ flanked 
by adenovirus coordinates 0-1 on one side and 9-16.1 on 

10 the other. Plasmid DNA was linearized using a unique 

Nhel site immediately 5' to adenovirus map unit zero (0) 
(resulting in the top diagram of Fig. IB) . 

Both the adenovirus substrate and the 
complementing vector DNAs were transfected to 293 cells 

15 [ATCC CRL1573] using a standard calcium phosphate 

transfection procedure [see, e.g., Sambrook et al, cited 
above] . The end result of homologous recombination 
involving sequences that map to adenovirus map units 9- 
16.1 is hybrid Ad.AV.CMVLacZ (see Fig. 1C) in which the 

20 Ela and Elb coding regions from the dl7001 adenovirus 
substrate are replaced with the AV.CMVLacZ from the 

hybrid vector. 

Twenty-four hours later, the transfection 
cocktail was removed and the cells overlayed with 0.8% 

25 agarose containing lx BME and 2% fetal bovine serum 

(FBS) . Once viral plaques developed (typically 10-12 
days post-transf ection) , plaques were initially screened 
for E. coli p-galactosidase (LacZ) activity by overlaying 
the infected monolayer with agarose supplemented with a 

30 histochemical stain for LacZ, according to the procedure 
described in J. Price et al, Proc. Natl. Acad - sci.. USA, 
84:156-160 (1987). Positive clones (identified by the 
deposit of insoluble blue dye) were isolated, subjected 
to three rounds of freeze (dry ice/ethanol) - thaw (37°C) 

35 and an aliquot of the suspended plaque was used to infect 
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a fresh monolayer of 293 cells seeded on duplicate 60mm 
plates . 

Twenty-four hours later the cells from one set 
of plates were fixed and again stained for LacZ activity. 
5 Cells from the duplicate plate w^re harvested, suspended 
in 0.5 ml 10 mM Tris-Cl, pH8.0, and lysed by performing a 
series of three freeze (dry ice/ ethanol) -thaw (37°C) 
cycles. Cell debris was removed by centrifugation and an 
aliquot of the supernatant used to measure LacZ enzyme 
10 activity. 

As indicated in Fig. 3, assays for p- 
galactosidase activity which measured the absorbance at 
420 nm of the beta-galactosidase blue color in successful 
recombinants, revealed that eight of the ten isolated, 
15 putative positive clones (D1A through D1J) expressed high 
levels of enzyme. Histochemical staining produced 

similar results. 

Large-scale production and purification of 
recombinant virus was performed as described in Kozarsky 
20 et al, cited above, and references cited therein. 

Fyample 2 - Functional Analysis of Hybrid Vector 

The ability to rescue the AV.CMVLacZ sequence 
[SEQ ID NO: 1] from the hybrid virus represented an 
25 important feature of the hybrid vector and virus systems 
of Example 1. To evaluate this feature, it was necessary 
to provide the necessary AAV gene products in trans that 
direct AAV excision and amplification (i.e. rep 
proteins) . Furthermore, this experiment was conducted in 
30 293 cells to transcomplement the El deletion in the 
Ad. AV.CMVLacZ clones, because the adenovirus El gene 
proteins have been shown to be important for initiating 
the lytic phas of the AAV lifecycle. 

293 cells wer s eded onto 6-well 35 mm plates 
35 at a density of 1 x 10 6 cells/well. Twenty-four hours 
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later, seeding media [DMEM/10% FBS supplemented with 
antibiotics] was replaced with 1.0 il DMEM/2% FBS and 
infected with Ad. KV.CMVLacZ hybrid clones at an MOI of 1. 
Two hours later, each well was transfected with 1 /xg 
5 plasmid pRep78/52 [SEQ ID NO: 2j, a trans-acting plasmid 
that encodes the sequence encoding the AAV rep 78 kD and 
52 kD proteins. The rep sequences in this construct are 
under the control of the AAV P5 promoter and utilize an 
SV40 polyadenylation signal. 

10 As a positive control for AAV rescue, 293 cells 

seeded in a 6-well plate as above were co-transfected 
with a cis-acting AAV plasmid pAV. CMVLacZ and pRep78/52. 
pAV.CMVLacZ contained AV.CMVLacZ, the identical sequence 
encoded by pAd . AV . CMVLacZ [SEQ ID NO: 1] described in 

15 Example 1 cloned into the Bglll site of pSP72 (Promega) . 

To provide the necessary adenovirus helper 
function for AAV rescue, cells were infected with either 
wild-type Ad5 virus or a first generation El-deleted 
virus Ad.CMhpAP at an MOI of 5., approximately 2 hours 

20 prior to adding the transfection cocktail. Ad.CMhpAP is 
identical to Ad.CMVLacZ (Example 1) with the modification 
that the alkaline phosphatase sequence (which can be 
obtained from Genbank) is inserted in place of the LacZ 
gene . 

25 Transf ections were performed with Lipof ectamine 

(Life Technologies) according to the instructions 
provided by the manufacturer. Thirty hours post- 
transfection, the cells were harvested and episomal DNA 
(Hirt extract) prepared as described by J. M. Wilson et 

30 al, J- Biol. Chem. , 267 : (16) : 11483-11489 (1992). Samples 
were resolved on a 1.2% agarose gel and electroblotted 
onto a nylon membrane. Blots were hybridized (Southern) 
with a 32 P random primer-labeled restriction fragment 
isolated from the E. coli LacZ cDNA. 
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The full spectrum of duplex molecular species 
that appear during a lytic AAV infection (i.e., monomer ic 
forms of the double stranded intermediates, RFm and RFd, 
respectively) were evident in transfected cells infected 
5 with wild type and El deleted AdJ. No replicative 
intermediates were detected when transf ections were 
performed in the absence of helper virus. 

Hirt extracts from the 293 cells infected with 
putative Ad.AV.CMVLacZ hybrid clones D1A and D1C revealed 

10 a single band corresponding to the viral DNA, when probed 
with a LacZ restriction fragment. In the presence of rep 
proteins 78 and 52, however, the same clones yielded a 
banding pattern that included not only the adenovirus 
DNA, but an RF monomer and dimer of AV.CMVLacZ. A 

15 single-stranded form of AV.CMVLacZ [SEQ ID NO: 1] was not 
evident. Two additional clones gave similar banding 
patterns, DIB and D1H. In all, each of the eight 
Ad . AV . CMVLacZ hybrids that were found in Fig. 3 to 
express high levels of Lac Z activity were positive for 

20 rescue of the AAV domain. 

With the exception of an extra band of 
approximately 3.5 kb, the rescue of the AV.CMVLacZ [SEQ 
ID NO: 1] from the hybrid viral DNA was nearly identical 
to results obtained from a standard cis and trans 

25 plasmid-based approach. In these later samples, 

adenovirus helper function was provided by pre-inf ecting 
cells with either wild-type Ad5 or an El-deleted 
recombinant virus Ad.CBhpAP (also termed H5.CBALP) . The 
Ad.CBhpAP virus has the same sequence as the Ad.CMhpAP 

30 virus described above, except that the CMV promoter 

sequence is replaced by the chicken cytoplasmic B-actin 
promoter [nucleotides -hi to +275 as described in T. A. 
Kost et al, Nucl. Acids R s. . 11(23) :8287 (1983)]. The 
level of rescue in cells infected with WT Ad5 appeared to 

35 be greater relative to those infected with the 
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recombinant Ad.CEhpAP virus, likely due to the additional 
El expression provided by the wild-type genome. The 
relevance of including an El deleted adenovirus here is 
to document that the level of adenovirus El proteins 
5 expressed in 293 cells is sufficient for AAV helper 
function. 

Example 3 - Synthesis of Polvlysine Conjugates 

Another version of the viral particle of this 

10 invention is a polylysine conjugate with a rep plasmid 
complexed directly to the hybrid virus capsid. This 
conjugate permits efficient delivery of the rep 
expression plasmid pRep78/52 [SEQ ID NO: 2] in tandem 
with the hybrid virus, thereby removing the need for a 

15 separate transfection step. See, Fig. 8 for a 
diagrammatic outline of this construction. 

Purified stocks of a large-scale expansion of 
Ad.AV.CMVLacZ clone D1A were modified by coupling poly-L- 
lysine to the virion capsid essentially as described by 

20 K. J. Fisher and J. M. Wilson, Biochem. J. , 299:49-58 

(1994), resulting in an Ad.AV.CMVLacZ-(Lys) n conjugate. 
The procedure involves three steps. First, hybrid 
virions are activated through primary amines on capsid 
proteins with the heterobifunctional water-soluble cross- 

25 linking agent, sulpho-SMCC [sulpho- (N-succinimidyl 4-(N- 
maleimidomethyl) -cyclohexane-l-carboxylate] (Pierce) . 
The conjugation reaction, which contained 0.5 mg (375 
nmol) of sulpho-SMCC and 6 x 10 12 A 260 hybrid vector 
particles in 3.0 ml of HBS, was incubated at 30°C for 45 

30 minutes with constant gentle shaking. This step involved 
formation of a peptide bond between the active N- 
hydroxysuccinimide (NHS) ester of sulpho-SMCC and a free 
amine (e.g. lysine) contributed by an adenovirus protein 
sequence (capsid protein) in the recombinant virus, 

35 yielding a maleimide-activated viral particle. 
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Unincorporated, unreacted cross-linker was 
removed by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG 
(Bio-Rad Laboratories) column equilibrated with 50 mM 
Tris/HCl buffer, pH 7.0, and 150 mM NaCl. Peak A 260 
5 fractions containing maleimide-activated hybrid virus 
were combined and placed on ice. 

Second, poly-L- lysine having a molecular mass 
of 58 kDa at 10 mg/ml in 50 mM triethanolamine buffer (pH 
8.0), 150 mM NaCl and 1 mM EDTA was thiolated with 2- 

10 imminothiolane/HCl (Traut's Reagent; Pierce) to a molar 
ratio of 2 moles-SH/mole poly lysine under N 2 ; the cyclic 
thioimidate reacts with the poly (L-lysine) primary amines 
resulting in a thiolated polycation. After a 45 minute 
incubation at room temperature the reaction was applied 

15 to a 1 cm x 15 cm Bio-Gel P6DG column equilibrated with 

50 mM Tris/HCl buffer (pH 7.0), 150 mM NaCl and 2 mM EDTA 
to remove unincorporated Traut's Reagent. 

Quantification of free thiol groups was 
accomplished with Ellman's reagent [5,5 , -dithio-bis-(2- 

20 nitrobenzoic acid)], revealing approximately 2 mol of - 
SH/mol of poly (L-lysine) . The coupling reaction was 
initiated by adding 1 x 10 12 A 260 particles of maleimide- 
activated hybrid virus/mg of thiolated poly (L-lysine) and 
incubating the mixture on ice at 4°C for 15 hours under 

25 argon. 2-mercaptoethylamine was added at the completion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimide 
sites. 

Virus-poly lysine conjugates, Ad.AV.CMVLacZ- 
30 (Lys) n , were purified away from unconjugated poly(L- 
lysine) by ultracentrifugation through a CsCl step 
gradient with an initial composition of equal volumes of 
1.45 g/ml (bottom step) and 1.2 g/ml (top step) CsCl in 
10 mM Tris/HCl buffer (pH 8.0). Centrif ugation was at 
35 90,000 g for 2 hours at 5°C. The final product was 
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dialyzed against 20 mM Hepes buffer (pH 7.8) containing 
150 mM NaCl (HBS) . 

Complexes of Ad.AV.CMVLacZ- (Lys) n with 
pRep78/52 plasmid DNA [SEQ ID NO: 2] were formed by 
5 adding varying quantities of Ad. AV.CMVLacZ- (Lys) n in 50/xl 
HBS to 0.5 /ig of pRep78/52 plasmid DNA [SEQ ID NO: 2] in 
50/il HBS. After 30 minutes incubation at room 
temperature, a complex was formed of the hybrid virus 
Ad.AV.CMVLacZ-(Lys) n associated in a single particle with 

10 the plasmid DNA containing the rep genes. 

This complex , termed a trans-infection 
particle, was evaluated for DNA binding capacity by gel 
mobility shift assays performed as described in Fisher et 
al, cited above. This analysis revealed that the plasmid 

15 binding capacity of the purified conjugate (expressed as 
the number of A 260 particles Ad.AV.CMVLacZ-(Lys) n that 
can neutralize the charge contributed by 1 /xg plasmid 
DNA) was 1 /xg pRep78/52 plasmid DNA/6.0 x 10 10 A 260 
particles Ad.AV.CMVLac2-(Lys) n .^ 

20 

Example 4 - Trans-Infection Protocol to Demonstrate AAV 
Excision and Amplification 

Trans-infection complexes were prepared by 
mixing Ad.AV.CMVLacZ-(Lys) n conjugate with pRep78/52 

25 plasmid [SEQ ID NO: 2] and applied to 293 cells as 

follows. Ad.AV.CMViacZ-(Lys) n (6 x 10 10 A 260 particles) in 
100 /xl DMEM was added dropwise to a microfuge tube 
containing 1 /xg plasmid DNA in 100 /xl DMEM. The mixture 
was gently mixed and allowed to incubate at room 

30 temperature for 10-15 minutes. The trans-infection 

cocktail was added to 293 cells seeded in a 35 mm 6-well 
as detailed above. Thirty hours later, cells were 
harv sted and Hirt extracts prepared. 



35 



WO 96/13598 



PCT/US95/14018 



38 

Samples were resolved on a 1.2% agarose gel and 
electroblotted onto a nylon membrane. Blots were 
hybridized (Southern) with a P-32 random primer-labeled 
restriction fragment isolated from the E. coli LacZ cDNA. 
5 The Hirt extracts from 293 cells revealed a 

banding pattern that suggested the AV.CMVLacZ minigene 
sequence [SEQ ID NO: 1] was efficiently rescued from the 
hybrid conjugate. Both an RF monomer and dimer of the 
recombinant AV.CMVLacZ sequence were evident. As was 

10 observed previously, the rescue event was dependent on 
rep proteins since 293 cells that were trans-infected 
with a hybrid conjugate complexed with an irrelevant 
reporter plasmid expressing alkaline phosphatase (i.e. 
pCMVhpAP) revealed only Ad.AV.CMVLacZ DNA. This negative 

15 control for rescue was secondarily useful for 

demonstrating the high efficiency of gene transfer to 293 
cells that was achieved with the conjugate vehicle. 

A duplicate set of 293 cells that received 
hybrid conjugate which was further complexed with 

20 alkaline phosphatase expression plasmid were fixed 24 

hours after addition of the trans-infection cocktail and 
histochemically stained for LacZ as described in Price et 
al, cited above, or for alkaline phosphatase activity as 
described in J. H. Schreiber et al, BioTechnioues. 

25 14:818-823 (1993). Here LacZ was a marker for the 

Ad.AV.CMVLacZ hybrid, while alkaline phosphatase served 
as a reporter for the carrier plasmid. Greater than 90% 
of the monolayer was transduced with both p-galactosidase 
and alkaline phosphatase transgenes, showing the high 

30 efficiency of the conjugate delivery vehicle 

(differential staining revealed a blue color for the 
hybrids containing the LacZ marker and a purple color for 
the plasmids bearing the AP marker) . 

Because of the important role El proteins have 

35 for progression of the AAV lifecycle, it was critical to 
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test the efficiency of the hybrid delivery system in a 
setting where El proteins are not expressed. A 
trans-infection experiment using the hybrid conjugate 
complexed with pRep78/52 [SEQ ID NO: 2] was therefore 
5 conducted in HeLa cells [ATCC CCL2] to remove the 
involvement of El proteins. The findings suggested 
rescue of hV.CMVLacZ occurred evidenced by the 
accumulation of RF monomers and dimers. Rescue from HeLa 
cells (which unlike the 293 cells do not contain any 

10 adenovirus El proteins) revealed lower levels of rescue 
of the transgene. The expression of rep from the AAV P5 
promoter is upregulated by adenovirus El and signals the 
beginning of the AAV lytic cycle. In the absence of El, 
rep expression from the P5 promoter is virtually silent 

15 which is important for maintenance of the proviral latent 
stages of the AAV lifecycle. It is anticipated that a 
promoter not dependent on El expression will upon 
substitution for P5, overcome this problem. 

20 Example 5 - Integration of the Transgene 

A preliminary study has been performed to 
determine whether the AAV sequence rescued from the 
hybrid virus can achieve provirus status in a target cell 
(Fig. 7). Briefly, HeLa cells [ATCC CCL 2] were infected 

25 with the hybrid conjugate complexed with pRep78/52 [SEQ 
ID NO: 2] and passaged until stable colonies of LacZ 
expressing cells were evident. A duplicate plate of 
cells was infected with the same conjugate, but instead 
of being complexed with the pRep78/52 plasmid [SEQ ID NO: 

30 2], carried an irrelevant plasmid. These findings 

indicated that cells that received the Rep containing 
hybrid particle produced a greater number of stable LacZ 
positive colonies than cells that were infected with the 
control virus. This could be interpreted as a reflection 

35 of multiple rescue and integration events in cells that 
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expressed Rep proteins. However, it is possible that an 
episomal form of AAV that can persist for extended - 
periods of time was present. 

To establish the occurrence of integration into 
5 the chromosome of the minigene Av.CMVLacZ from the hybrid 
conjugate, the following experiment is performed. The 
Ad.AV.CMVLacZ-(Lys) n conjugate carrying pRep78/52 plasmid 
[SEQ ID NO: 2] is used to infect HeLa cells [ATCC CRL2] 
(primary fibroblasts may also be used) . The infected 
10 cells are passaged for several generations. The cells 
are grown to confluency, split and allowed to grow to 
conf luency again, split again and this cycle repeated as 
desired. This permits sufficient time for uptake, 
expression, replication and integration to occur. See 

15 Fig. 7. 

To verify that the recombinant AAV sequence 
that was rescued from the hybrid genome (step III of Fig. 
7) has integrated into a chromosome of the host cell 
(step IV of Fig. 7), cells are separated by a 

20 Fluorescence Activated Cell Sorter (FACS) . By this 

technique, those cells containing a stable integrated 
copy of the recombinant AV.CMVLacZ minigene are separated 
based on the presence of the (J-galactosidase reporter. 
These cells are tagged with f luorescein-labeled 

25 antibodies that recognize the p-Gal protein, and are then 
separated from non-transduced cells (i.e. those that did 
not receive a copy of the AAV minigene) by FACS. 

DNA is isolated from this purified population 
of cells and used to construct a genomic library which is 

30 screened for individual clones and the sequence verified. 
If integration occurs, it is documented directly by 
sequence analysis. 
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Example 6 - Gene Transfer Vehicle for Cystic Fibrosis 

An adenovirus -AAV-CFTR virus constructed by 
modifying the hybrid Ad.AV.CMVlacZ virus described in 
Example 1 to contain the cystic fibrosis transmembrane 
5 regulator (CFTR) gene [J.R. Rioxdan et al, Science, 

245:1066-1073 (1989)] in place of the lacZ gene, using 
known techniques. One suitable method involves producing 
a new vector using the techniques described in Example 1. 
In this new vector the LacZ minigene is replaced with the 

10 CFTR minigene. For performance of this method vectors 

bearing the CFTR gene have been previously described and 
can be readily constructed. This new or reconstructed 
vector is used to generate a new virus through homologous 
recombination as described above. The resulting hybrid 

15 virus is termed hybrid Ad . AV . CMVCFTf? . It has the 

sequence of Fig. 2 [SEQ ID NO: 1], except that the LacZ 
gene is replaced with CFTR. Alternatively, the LacZ gene 
can be removed from the Ad.AV.CMVLacZ vector of Example 1 
and replaced with the CFTR gene, using known techniques. 

20 This virus (or an analogous hybrid virus with a 

different promoter, regulatory regions, etc.) is useful 
in gene therapy alone, or preferably, in the form of a 
conjugate prepared as described in Example 4. 

Treatment of cystic fibrosis, utilizing the 

25 viruses provided above, is particularly suited for in 
vivo, lung-directed, gene therapy. Airway epithelial 
cells are the most desirable targets for gene transfer 
because the pulmonary complications of CF are usually its 
most morbid and life-limiting. Thus, the hybrid vector 

30 of the invention, containing the CFTR gene, is delivered 
directly into the airway, e.g. by formulating the hybrid 
virus above, into a preparation which can be inhaled. 
For example, the hybrid virus or conjugate of the 
inv ntion containing the CFTR gene, is suspended in 0.25 
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molar sodium chloride. The virus or conjugate is taken 
up by respiratory airway cells and the gene is expressed. 

Alternatively, the hybrid viruses or conjugates 
of the invention may be delivered by other suitable 
5 means, including site-directed injection of the virus 
bearing the CFTR gene. In the case of CFTR gene 
delivery, preferred solutions for bronchial instillation 
are sterile saline solutions containing in the range of 
from about 1 x 10 7 to 1 x 10 10 pfu/ml, more particularly, 

10 in the range of from about 1 x 10 8 to 1 x 10 9 pfu/ml of 
the recombinant hybrid virus of the present invention. 

Other suitable methods for the treatment of 
cystic fibrosis by use of gene therapy recombinant 
viruses of this invention may be obtained from the art 

15 discussions of other types of gene therapy vehicles for 
CF. See, for example, U. S. Patent No. 5,240,846, 
incorporated by reference herein. 

Example 7 - Gene Transfer Ve hicle for Familial 

20 Hypercholesterolemia 

Familial hypercholesterolemia (FH) is an 
autosomal dominant disorder caused by abnormalities 
(deficiencies) in the function or expression of LDL 
receptors [M.S. Brown and J.L. Goldstein, Science, 

25 232(4746) :34-37 (1986); J.L. Goldstein and M.S. Brown, 
"Familial hypercholesterolemia" in Metabolic Basis of 
inherited Disease. , ed. C.R. Scriver et al, McGraw Hill, 
New York, ppl215-1250 (1989).] Patients who inherit one 
abnormal allele have moderate elevations in plasma LDL 

30 and suff r premature life-threatening coronary artery 
disease (CAD) . Homozygous patients have severe 
hypercholesterolemia and life- threatening CAD in 

childhood. 
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A hybrid adenovirus -AAV-LDL virus of the 
invention is constructed by replacing the lacZ gene in 
the hybrid Ad.AV.CMVlacZ virus of Example 1 with an LDL 
receptor gene [T. Yamamoto et al, Cell , 29:27-38 (1984)] 
5 using known techniques and as described analogously for 
CF in the preceding example. Vectors bearing the LDL 
receptor gene can be readily constructed according to 
this invention. The resulting hybrid vector is termed 
pAd.AV.CMVLDL. 

10 This plasmid or its recombinant virus is useful 

in gene therapy of FH alone, or preferably, in the form 
of a viral conjugate prepared as described in Example 4 
to substitute a normal LDL gene for the abnormal allele 
responsible for the gene. 

15 A. Ex Vivo Gene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 

20 vector (s) bearing the LDL receptor gene(s). For example, 
techniques of collagenase perfusion developed for rabbit 
liver can be adapted for human tissue and used in 
transduction. Following transduction, the hepatocytes 
are removed from the tissue culture plates and reinfused 

25 into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 

B. In Vivo Gene Therapy 

Desirably, the in vivo approach to gene 
therapy, e.g. liver-directed, involves the use of the 

30 hybrid viruses and viral conjugates described above. A 
preferred treatm nt involves infusing a trans-infection 
particle of the invention containing LDL into the 
peripheral circulation of the patient. The patient is 
then evaluated for change in serum lipids and liver 

35 tissues. 
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The hybrid virus or viral conjugate can be 
used to infect hepatocytes in vivo by direct injection 
into a peripheral or portal vein (10 7 -10 8 pfu/kg) or 
retrograde into the biliary tract (same dose) . This 
5 effects gene transfer into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 
anticipated to lead to high level gene expression in the 
10 majority of hepatocytes. 

Example 8 - Efficient Production of Recombina nt AAV usim 
A Hybrid Virus /Conjugate 

The following experiment demonstrated that the 

15 AAV genome that was rescued from the Ad . AV . CtWLacZ hybrid 
virus could be packaged into an AAV capsid, provided the 
cap open reading frame was supplied in trans. Thus the 
viruses of this invention are useful in a production 
method for recombinant AAV which, overcomes the prior art 

20 complications that surround the high titer production of 

recombinant AAV. 

A. Trans-Infection Protoco l for the 

Production of rAAV 

A trans-infection complex was formed 

25 composed of the Ad.AV.CMVLacZ-(Lys) n conjugate described 
above and a transcomplementing plasmid pAdAAV, which is 
described in detail in R. J. Samulski et al, J. Virol. , 
63(9) :3822-3828 (1989)]. Briefly, plasmid pAdAAV encodes 
the entire rep and cap open reading frames in the absence 

30 of AAV ITRs, and has been shown to provide the necessary 
AAV helper functions for replication and packaging of 

recombinant AAV sequences. 

Ad.AV.CMVLacZ-(Lys) n conjugate (4.5 x 10 13 
A 260 particles) in 75 ml DMEM was added dropwise with 
35 constant gentle swirling in 25 ml DMEM containing 750 ng 
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pAdAAV helper pl&smid and incubated at room temperature 
for 10-15 minutes. The complex was diluted with 450 ml 
DMEM supplemented with 2% FBS and 20 ml aliquots were 
added to monolayers of 293 cells seeded on 150 mm plates. 
5 Forty hours post trant*-inf ection, cells were 

harvested, suspended in 12 ml 10 mM Tris-Cl (pH 8.0), and 

stored at -80°C. 

Because the anticipated outcome was the 

production of hybrid virus Ad . AV . CMVLacZ and a 

10 recombinant AAV virion (AV.CMVLacZ) , both of which carry 
a functional LacZ minigene, it was not possible to use 
detection of LacZ activity as an indicator of AV.CMVLacZ 
production. A novel molecular approach was developed 
that could be performed in one day and permitted 

15 identification of the packaged viral DNAs. 

B. Purification of rAAV 

Briefly, frozen cell suspensions were 
subjected to three rounds of freeze-thaw cycles to 
release recombinant AV.CMVLac^ and hybrid Ad. AV.CMVLacZ. 

20 On completion of the final thaw, bovine pancreatic DNAs 
(2000 units) and ribonuclease (0.2 mg/ml final 
concentration) was added and the extract incubated at 
37 °C for 30 minutes. Cell debris was removed by 
centrifugation (5000xg for 10 minutes) and the clarified 

25 supernatant (15 ml) applied to a 22.5 ml step gradient 

composed of equal volumes of CsCl at 1.2 g/ml, 1.36 g/ml, 
and 1.45 g/ml lOmM Tris-Cl, pH8.0. Viral particles were 
banded at 25,000 rpm in a Beckman SW-28 rotor for 8 hours 
at 4°C. One ml fractions were collected from the bottom 

30 of the tube. 

The fractions retrieved from the CsCl 
gradient of partially purified virus are then digested to 
release viral DNA from virion capsids as follows. A 
5.0/xl sample of each fraction was transferred to a 
35 microfuge tube containing 20 fil capsid digestion buffer 
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(50mM Tris-Cl, pHC.O, l.OmM EDTA, pH8,0, 0.5% SDS, and 
1.0 mg/ml Proteinase K) . The reaction was incubated at 
50 °C for 1 hour, allowed to cool to room temperature, 
diluted with 10 pi milli-Q water, and agarose gel loading 
5 dye added. 

These fractions are then analyzed by 
Southern blotting. Samples were resolved on a 1.2% 
agarose gel, electrob lotted onto a nylon membrane. A 32 P 
labeled LacZ restriction fragment which was common to 
10 both vectors was used as a hybridization probe to locate 
the migration of viral DNA through the agarose gel. 
Viral bands were quantitated on a Molecular Dynamics 
Pho spho imager . 

A sample of the extract before CsCl 

15 banding was also tested and revealed both hybrid 

Ad.AV.CMVLacZ DNA and double-stranded RF forms (monomers 
and dimers) of the rescued AV.CMVLacZ sequence [SEQ ID 
NO: 1]. A single-stranded monomer of AV.CMViacZ appeared 
to be present in the crude extract; however, it was not 

20 until the virions were concentrated by buoyant density 
ultracentrifugation that the single-stranded genome 
became clearly evident. The single-stranded recombinant 
genome of the virus was distributed over a range of CsCl 
densities and revealed a biphasic banding pattern. The 

25 two peaks of single- stranded rAAV genome occurred at 

densities of 1.41 and 1.45 g/ml CsCl, consistent with the 
reported buoyant densities of wild-type AAV in CsCl [L. 
M. de la Maza et al, J. Virol. . 33:1129-1137 (1980)]. 
Analysis of the fractions corresponding to the two vector 

30 forms revealed the rAAV-1.41 species was several orders 
of magnitude more active for lacZ transduction than the 
denser rAAV-1.45 g/ml variant. To avoid confusion with 
contaminating Ad. AAV, samples were heat inactivated (60°C 
for 30 min) before being added to indicator HeLa cells. 
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The peak fractions of rAAV-1.41 were 
combined and purified by equilibrium sedimentation in 
CsCl to eliminate residual adenovirus particles and 
concentrate rAAV virions. On the final round of 
5 ultracentrifugation, a faint buc clearly visible 
opalescent band was observed in the middle of the 
gradient tube. Fractions that surrounded the band were 
evaluated for density, absorbance at 260 nm, and lacZ 
transducing particles. As the band eluted from the 

10 gradient tube, a well defined peak of 260 nm absorbing 
material was recorded, with a maximal absorbance 
occurring at a density of 1.40 g/ml CsCl. Analysis for 
lacZ transducing particles on HeLa cells revealed a peak 
of activity that mirrored the absorbance profile. These 

15 results indicate rAAV was produced from the hybrid Ad. AAV 
virus. Furthermore, the titers achieved using the hybrid 
virus were 5-10 fold elevated compared to more 
conventional recombinant AAV production schemes (i.e., 
transf ections with cis- and trans-acting plasmids) . This 

20 represents a significant improvement in rAAV production 
and indicates that the hybrid is useful for large-scale 



rAAV production. 

All references recited above are incorporated 

herein by reference. Numerous modifications and 
25 variations of the present invention are included in the 
above-identified specification and are expected to be 
obvious to one of skill in the art. Such modifications 
and alterations to the compositions and processes of the 
present invention, such as those modifications permitting 
30 optimal use of the hybrid viruses as gene therapy 

vehicles or production vehicles for recombinant AAV 
production, are believed to b encompassed in the scope 
of the claims appended hereto. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Trustees of the University of Pennsylvania 

Wilson, James M. 
Kelley, William M. 
Fisher, Krishna J. 

(ii) TITLE OF INVENTION: Hybrid Adenovirus -AAV Vector and 

Methods of Use Thereof 

(iii) NUMBER OF SEQUENCES: 2 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/331,384 

(B) FILING DATE: 28-OCT-1994 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bak, Mary E. 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/ DOCKET NUMBER: GNVPN . 007PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GAATTCGCTA 


GCATCATCAA 


TAATATACCT 


T A r T , f ,r PTGG A T 1 

1A1 XXX w wx\ X 


TGA AGPPA AT 


en 


ATGATAATGA 


GGGGGTGGAG 


TTTGTGACGT 


Www www www 


GTGGG A A PGG 

VJ X w w wXix\wVJw 


i nn 


GGCGGGTGAC 


GTAGTAGTGT 


GGCGGAAGTG 


TGATGTTGPA 

X u/i X w X X w wXl 


A GTGTGG PGG 

XiVJ X w* X ww www* 


1 

X>*7 LI 


AACACATGTA 


AGCGACGGAT 


GTGGCAAAAG 

W X VJVrf WXlX^X&XlW 


X VJnUU X X X X X 


GGTGTGPGPP 

UVJlU X OwOww 




GGTGTACACA 


GGAAGTGACA 


ATTTTCGCGP 

X J. X ww^www 


GGTTTTT 1 A GflP 
wwl X X XnuuL 


fin a TfiTTP t a 

V?un X V» X X \s X A 


OCA 
ZjU 


GTAAATTTGG 


GCGTAACCGA 


GTAAGATTTG 

w X XUlwA XXX VJ 


gppattttp^ 

uUWil XXX Lb 






GAATAAGAGG 


AAGTGAAATC 


TGAATAATTT 

x wxiun x xvn x x x 


IVjiui Xrlw X w 


A T A flCCl HfW A 




ATATTTGTCT 


AGGGAGATCT 


GCTGCGCGCT 


wVJ w X www X wXi 


w X uAVjuLUuL 


Ann 


CCGGGCAAAG 


CCCGGGCGTC 


GGGCGACCTT 


TGGTCGCCCG 


GCCTCAGTGA 


450 


GCGAGCGAGC 


GCGCAGAGAG 


GGAGTGGCCA 


ACTCCATCAC 


TAGGGGTTCC 


500 


TTGTAGTTAA 


TGATTAACCC 


GCCATGCTAC 


TTATCTACAA 


TTCGAGCTTG 


550 


CATGCCTGCA 


GGTCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


600 


CCGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 


650 


AGTAACGCCA 


ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GAGTATTTAC 


700 


GGTAAACTGC 


CCACTTGGCA 


GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


750 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


CCCGCCTGGC 


ATTATGCCCA 


800 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


TACGTATTAG 


850 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


900 


GGATAGCGGT 


TTGACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


950 


CAATGGGAGT 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


1000 



t 
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GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG 1050 

GAGGTCTATA TAAGCAGAGC TCGTTTAGTG AACCGTCAGA TCGCCTGGAG 1100 

ACGCCATCCA CGCTGTTTTG ACCTCCATAG AAGACACCGG GACCGATCCA 1150 

GCCTCCGGAC TCTAGAGGAT CCGGTACTCG AGGAACTGAA AAACCAGAAA 1200 

GTTAACTGGT AAGTTTAGTC TTTTTGTCTT TTATTTCAGG TCCCGGATCC 1250 

GGTGGTGGTG CAAATCAAAG AACTGCTCCT CAGTGGATGT TGCCTTTACT 1300 

TCTAGGCCTG TACGGAAGTG TTACTTCTGC TCTAAAAGCT GCGGAATTGT 1350 

ACCCGCGGCC GCAATTCCCG GGGATCGAAA GAGCCTGCTA AAGCAAAAAA 1400 

GAAGTCACCA TGTCGTTTAC TTTGACCAAC AAGAACGTGA TTTTCGTTGC 1450 

CGGTCTGGGA GGCATTGGTC TGGACACCAG CAAGGAGCTG CTCAAGCGCG 1500 

ATCCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG CGTTACCCAA 1550 

CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA 1600 

AGAGGCCCGC ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG 1650 

AATGGCGCTT TGCCTGGTTT CCGGCACCAG AAGCGGTGCC GGAAAGCTGG 1700 

CTGGAGTGCG ATCTTCCTGA GGCCGATACT GTCGTCGTCC CCTCAAACTG 1750 

GCAGATGCAC GGTTACGATG CGCCCATCTA CACCAACGTA ACCTATCCCA 1800 

TTACGGTCAA TCCGCCGTTT GTTCCCACGG AGAATCCGAC GGGTTGTTAC 1850 

TCGCTCACAT TTAATGTTGA TGAAAGCTGG CTACAGGAAG GCCAGACGCG 1900 

AATTATTTTT GATGGCGTTA ACTCGGCGTT TCATCTGTGG TGCAACGGGC 1950 

GCTGGGTCGG TTACGGCCAG GACAGTCGTT TGCCGTCTGA ATTTGACCTG 2000 

AGCGCATTTT TACGCGCCGG AGAAAACCGC CTCGCGGTGA TGGTGCTGCG 2050 

TTGGAGTGAC GGCAGTTATC TGGAAGATCA GGATATGTGG CGGATGAGCG 2100 

GCATTTTCCG TGACGTCTCG TTGCTGCATA AACCGACTAC ACAAATCAGC 2150 

GATTTCCATG TTGCCACTCG CTTTAATGAT GATTTCAGCC GCGCTGTACT 2200 

GGAGGCTGAA GTTCAGATGT GCGGCGAGTT GCGTGACTAC CTACGGGTAA 2250 

CAGTTTCTTT ATGGCAGGGT GAAACGCAGG TCGCCAGCGG CACCGCGCCT 2300 
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TTCGGCGGTG AAATTATCGA TGAGCGTGGT 
ACTACGTCTG AACGTCGAAA ACCCGAAACT 
ATCTCTATCG TGCGGTGGTT GAACTGCACA 
GAAGCAGAAG CCTGCGATGT CGGTTTCCGC 
TCTGCTGCTG CTGAACGGCA AGCCGTTGCT 
ACGAGCATCA TCCTCTGCAT GGTCAGGTCA 
CAGGATATCC TGCTGATGAA GCAGAACAAC 
GCATTATCCG AACCATCCGC TGTGGTACAC 
TGTATGTGGT GGATGAAGCC AATATTGAAA 
AATCGTCTGA CCGATGATCC GCGCTGGCTA 
AACGCGAATG GTGCAGCGCG ATCGTAATCA 
CGCTGGGGAA TGAATCAGGC CACGGCGCTA 
TGGATCAAAT CTGTCGATCC TTCCCGCCCG 
AGCCGACACC ACGGCCACCG ATATTATTTG 
ATGAAGACCA GCCCTTCCCG GCTGTGCCGA 
CTTTCGCTAC CTGGAGAGAC GCGCCCGCTG 
CGCGATGGGT AACAGTCTTG GCGGTTTCGC 
GTCAGTATCC CCGTTTACAG GGCGGCTTCG 
TCGCTGATTA AATATGATGA AAACGGCAAC 
TGATTTTGGC GATACGCCGA ACGATCGCCA 
TCTTTGCCGA CCGCACGCCG CATCCAGCGC 
CAGCAGTTTT TCCAGTTCCG TTTATCCGGG 
CGAATACCTG TTCCGTCATA GCGATAACGA 
CGCTGGATGG TAAGCCGCTG GCAAGCGGTG 
CCACAAGGTA AACAGTTGAT TGAACTGCCT 
CGCCGGGCAA CTCTGGCTCA CAGTACGCGT 



GGTTATGCCG ATCGCGTCAC 2350 

GTGGAGCGCC GAAATCCCGA 2400 

CCGCCGACGG CACGCTGATT 2450 

GAGrxJTGCGGA TTGAAAATGG 2500 

GATTCGAGGC GTTAACCGTC 2550 

TGGATGAGCA GACGATGGTG 2600 

TTTAACGCCG TGCGCTGTTC 2650 

GCTGTGCGAC CGCTACGGCC 2700 

CCCACGGCAT GGTGCCAATG 2750 

CCGGCGATGA GCGAACGCGT 2800 

CCCGAGTGTG ATCATCTGGT 2850 

ATCACGACGC GCTGTATCGC 2900 

GTGCAGTATG AAGGCGGCGG 2950 

CCCGATGTAC GCGCGCGTGG 3000 

AATGGTCCAT CAAAAAATGG 3050 

ATCCTTTGCG AATACGCCCA 3100 

TAAATACTGG CAGGCGTTTC 3150 

TCTGGGACTG GGTGGATCAG 3200 

CCGTGGTCGG CTTACGGCGG 3250 

GTTCTGTATG AACGGTCTGG 3300 

TGACGGAAGC AAAACACCAG 3350 

CAAACCATCG AAGTGACCAG 3400 

GCTCCTGCAC TGGATGGTGG 3450 

AAGTGCCTCT GGATGTCGCT 3500 

GAACTACCGC AGCCGGAGAG 3550 

AGTGCAACCG AACGCGACCG 3600 
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CATGGTCAGA AGCCGGGCAC ATCAGCGCCT 
GAAAACCTCA GTGTGACGCT CCCCGCCGCG 
GACCACCAGC GAAATGGATT TTTGCATCGA 
AATTTAACCG CCAGTCAGGC TTTCTTTCAC 
AAACAACTGC TGACGCCGCT GCGCGATCAG 
TAACGACATT GGCGTAAGTG AAGCGACCCG 
TCGAACGCTG GAAGGCGGCG GGCCATTACC 
CAGTGCACGG CAGATACACT TGCTGATGCG 
CGCGTGGCAG CATCAGGGGA AAACCTTATT 
GGATTGATGG TAGTGGTCAA ATGGCGATTA 
AGCGATACAC CGCATCCGGC GCGGATTGGC 
GGTAGCAGAG CGGGTAAACT GGCTCGGATT 
CCGACCGCCT TACTGCCGCC TGTTTTGACC 
GACATGTATA CCCCGTACGT CTTCCCGAGC 
GACGCGCGAA TTGAATTATG GCCCACACCA 
TCAACATCAG CCGCTACAGT CAACAGCAAC 
CATCTGCTGC ACGCGGAAGA AGGCACATGG 
TATGGGGATT GGTGGCGACG ACTCCTGGAG 
TACAGCTGAG CGCCGGTCGC TACCATTACC 
TAATAATAAC CGGGCAGGCC ATGTCTGCCC 
CATTATGTAC TATTTAAAAA ACACAAACTT 
TTTTCTTTTA CTTTTTTATC ATGGGAGCCT 
TGGCTACATG ACATCAACCA TATCAGCAAA 
TGCCGCTATT TCTCTGTTCT CGCTATTATT 
TTTCTGACAA ACTCGGCCTC GACTCTAGGC 
GATAAGATAC ATTGATGAGT TTGGACAAAC 



GGCAGCAGTG 


(jr ww- 1L1 l^V? wo- 


J \J *J w 


TCCCACGCCA 


rn OOOO O 7A TPT 


^700 


GCTGGGTAAT 


AAGww'l 1 w-VjU 


3750 


AG AIL. I I 


tppp^iata a a 

1. ww wwfl J. AAA 


3800 

*J w w w 


TTCAC w ww- 1 u 


wAw www J. wvJA 


3850 


CATTGACCC 1 


a a r*r pptppp 

AAwljww 1 VJVJVJ 


3900 


IV f+ f^f^f* tv a o 

AGGCCGAAGC 


Avj^ut 1 IbJ. iO 


3950 


GTGCTGATTA 


00 a. OOP PTP A 


4000 


TATCAGCCGG 


AAA APPTAPP 
AAAAww 1 Aww 


4050 

*Z \J ~J \J 


CCGTTGATGT 


1 (jAAu 1 ubtu 


4100 

t M \J V 


ii/^ iv a ^mo f>r> 
CTGAACTGCw 


7A PPT^PPPP-P A 


4150 


AGGGCwGwAA 


PA A A APT ATP 


4200 

At w w 


/"»m/** A mom 

GCTGGGA 1L1 


PPP ATTPTPA 


4250 


TV TV A A OOOmO 

G AAAACJGij 1 w 


TPPPPTP.PP.rZ 


4300 


GTGGCGCGGC 


O A. OTTOOAO.T 


4350 


iv m^^* iv iv iv o 

TGATGGAAAC 


O A O OP A TPP.P 


4400 


CTGAATATCG 


IV OMlflUMPO A 

ACGG 111 w wA 


4450 


CCCGTCAGTA 


TCuuLbbAAl 


4500 


AGTTGGTCTG 


omomoA Ik A A A 
G I G 1 IAAAAA 


4550 


GTATTTCGCG 


m A AOO A A ATP 
TAAbbiiAAl w 


4600 

*x \J v/ v/ 


TTGGATGTTC 


GGTTTATTCT 


4650 


ACTTCCCGTT 


TTTCCCGATT 


4700 


AGTGATACGG 


GTATTATTTT 


4750 


CCAACCGCTG 


TTTGGTCTGC 


4800 


GGCCGCGGGG 


ATCCAGACAT 


4850 


CACAACTAGA 


ATGCAGTGAA 


4900 
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AAAAATGCTT TATTTGTGAA ATTTGTGATG 
ATTATAAGCT GCAATAAACA AGTTAACAAC 
GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT 
CGAGTAGATA AGTAGCATGG CGGGTTAATC 
AGTGATGGAG TTGGCCACTC CCTCTCTGCG 
CCGGGCGACC AAAGGTCGCC CGACGCCCGG 
GTGAGCGAGC GAGCGCGCAG CAGATCTGGA 
ACCCGCACCA GGTGCAGACC CTGCGAGTGT 
CCAGCCTGTG ATGCTGGATG TGACCGAGGA 
TGCTGGCCTG CACCCGCGCT GAGTTTGGCT 
TGAGGTACTG AAATGTGTGG GCGTGGCTTA 
AGGTGGGGGT CTTATGTAGT TTTGTATCTG 
CCATGAGCAC CAACTCGTTT GATGGAAGCA 
ACGCGCATGC CCCCATGGGC CGGGGTGCGT 
CATTGATGGT CGCCCCGTCC TGCCCGCAAA 
AGACCGTGTC TGGAACGCCG TTGGAGACTG 
GCCGCTGCAG CCACCGCCCG CGGGATTGTG 
CCCGCTTGCA AGCAGTGCAG CTTCCCGTTC 
TGACGGCTCT TTTGGCACAA TTGGATTCTT 
GTTTCTCAGC AGCTGTTGGA TCTGCGCCAG 
TTCCTCCCCT CCCAATGCGG TTTAAAACAT 
TTTGGATTTG GATCAAGCAA GTGTCTTGCT 
CGCGCGCGGT AGGCCCGGGA CCAGCGGTCT 
TATTTTTTCC AGGACGTGGT AAAGGTGACT 
GCATAAGCCC GTCTCTGGGG TGGAGGTAGC 
TGCGGGGTGG TGTTGTAGAT GATCCAGTCG 



CTATTGCTTT ATTTGTAACC 4950 

AACAATTGCA TTCATTTTAT 5000 

TTTTTCGGAT CCTCTAGAGT 5050 

ATTAACTACA AGGAACCCCT 5100 

CGCTCGCTCG CTCACTGAGG 5150 

GCTTTGCCCG GGCGGCCTCA 5200 

AGGTGCTGAG GTACGATGAG 5250 

GGCGGTAAAC ATATTAGGAA 5300 

GCTGAGGCCC GATCACTTGG 5350 

CTAGCGATGA AGATACAGAT 5400 

AGGGTGGGAA AGAATATATA 5450 

TTTTGCAGCA GCCGCCGCCG 5500 

TTGTGAGCTC ATATTTGACA 5550 

CAGAATGTGA TGGGCTCCAG 5600 

CTCTACTACC TTGACCTACG 5650 

CAGCCTCCGC CGCCGCTTCA 5700 

ACTGACTTTG CTTTCCTGAG 5750 

ATCCGCCCGC GATGACAAGT 5800 

TGACCCGGGA ACTTAATGTC 5850 

CAGGTTTCTG CCCTGAAGGC 5900 

AAATAAAAAA CCAGACTCTG 5950 

GTCTTTATTT AGGGGTTTTG 6000 

CGGTCGTTGA GGGTCCTGTG 6050 

CTGGATGTTC AGATACATGG 6100 

ACCACTGCAG AGCTTCATGC 6150 

TAGCAGGAGC GCTGGGCGTG 6200 
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GTGCCTAAAA ATGTCTTTCA CTAGCAAGCT 
TGGTGTAAGT GTTTACAAAG CGGTTAAGCT 
GATATGAGAT GCATCTTGGA CTGTATTTTT 
CATATCCCTC CGGGGATTCA TGTTGTGCAG 
CGGTGCACTT GGGAAATTTG TCATGTAGCT 
AACTTGGAGA CGCCCTTGTG ACCTCCAAGA 
AATGATGGCA ATGGGCCCAC GGGCGGCGGC 
GATCACTAAC GTCATAGTTG TGTTCCAGGA 
TTTACAAAGC GCGGGCGGAG GGTGCCAGAC 
CGGCCCAGGG GCGTAGTTAC CCTCACAGAT 
GTTCAGATGG GGGGATCATG TCTACCTGCG 
TCCGGGGTAG GGGAGATCAG CTGGGAAGAA 
CGACTTACCG CAGCCGGTGG GCCCGTAAAT 
ACTGGTAGTT AAGAGAGCTG CAGCTGCCGT 
ACTTCGTTAA GCATGTCCCT GACTCGCATG 
CAGAAGGCGC TCGCCGCCCA GCGATAGCAG 
TTTTCAACGG TTTGAGACCG TCCGCCGTAG 
CCAAGCAGTT CCAGGCGGTC CCACAGCTCG 
TCGATCCAGC ATATCTCCTC GTTTCGCGGG 
CGGCAGTAGT CGGTGCTCGT CCAGACGGGC 
GGCGCAGGGT CCTCGTCAGC GTAGTCTGGG 
CCGGGCTGCG CGCTGGCCAG GGTGCGCTTG 
GAAGCGCTGC CGGTCTTCGC CCTGCGCGTC 
TGGTGTCATA GTCCAGCCCC TCCGCGGCGT 
CCCTTGGAGG AGGCGCCGCA CGAGGGGCAG 
GAGCTTGGGC GCGAGAAATA CCGATTCCGG 



ci 2v r n r pcr , p agg 


GGCAGGCCCT 


6250 


GGGATGGGTG 

VJwUA X www X w 


CATACGTGGG 


6300 


AwwX lwVJWin 


TGTTCCCAGC 


6350 


ru\wwnwwAw w 


ACAGTGTATC 


6400 


TAGAAGGAAA 


TGCGTGGAAG 


6450 


X X X X ww A X w w 


ATTCGTCCAT 

X*s X X W^J A W W*» * 


6500 


PTCCGPfi A AG 
w X wVJUwVJnnu 


ATATTTCTGG 

XI X *» X X A W A 


6550 


TG&GATPGTP 
x unun X wu X w 


ATAGGCCATT 

f\xnwwwwf&^ a 


6600 


X w ww w x A x An 


TGGT'PPPATC 

X ww X X w vA X W 


6650 


1 1 (JWAI 1 1 WW 


PAPGPTTTGA 

wAwwwX X x wn. 


6700 




{1H & A APGGTT 

w/uulAwwV? X X 


6750 


AbwAwU 1 J. ww 


TGAGPAGCTG 

X w/iw UAw w X w 


6800 


wAwAww XriX X 


A PPGGGTGCA 

£\w www w X w wA 


6850 


paTpppTcan 

wA X WWW X W\rtAJ 


CAGGGGGGCC 

wflw w w w^wVJ WW 


6900 


X X X X www X un 


PPAAATCCGC 

Uvruw* x wwww 


6950 


TTPTTHPAAG 
X X w X X wUAnvj 


GAAGCAAAGT 


7000 


rp a TG PTTTT 

VjvAI wwX XXX 


GAGCGTTTGA 

w,flV7 W^7 X A A XJ** 


7050 


GTP A CC^GC^ 

w X wAw w X ww X 


PTACGGCATC 


7100 


X X www w www w 


TTTCGCTGTA 


7150 


P A GGGTP A TG 
Lavjuu X vn X w 


TPTTTCCACG 

X w XXX w vnv w 


7200 

9 mm 


TCACGGTGAA 


GGGGTGCGCT 


7250 


AGGCTGGTCC 


TGCTGGTGCT 


7300 


GGCCAGGTAG 


CATTTGACCA 


7350 


GGCCCTTGGC 


GCGCAGCTTG 


7400 


TGCAGACTTT 


TGAGGGCGTA 


7450 


GGAGTAGGCA 


TCCGCGCCGC 


7500 
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AGGCCCCGCA 


GACGGTCTCG 


CATTCCACGA 


GCCAGGTGAG 


CTCTGGCCGT 


7550 


fri oo /^f^m/** tv tv 

TCGGGGTCAA 


tv 14 iv ooiv oo mm 

AAACCAGGTT 


TCCCCCATGC 


TTTTTGATGC 


GTTTCTTACC 


7600 


TCTGGTTTCC 


tv mo tv /■» /"^ 

ATGAGCCGGT 


GTCCACGCTC 


GGTGACGAAA 


AGGCTGTCCG 


7650 


momooooom tv 

TGTCCCCGTA 


m tv o tv o ta ommo 

TACAGACTTG 


iv o iv I oo^^ I omom 

AGAGGCCTGT 


CCTcGACCGA 


TGCCCTTGAG 


7700 


tv rnomrrpA 7a o 




oommoooomo 

CCTTCCGGTG 


o o oo ,00 o o ^* 

GGCGCGGGGC 


ATGACTATCG 


7750 




m tv mo TA omomo 


TTCTTTATCA 


mooiv iv om^i^^m 

TGCAACTCGT 


AGGACAGGTG 


7800 


ooo o o 7v o oo o 


momooomoTv m 

TCTGGGTCAT 


mfnmooooo iv o 

TTTCGGCGAG 


GACCGCTTTC 


GCTGGAGCGC 


7850 


o 7v oo tv mo tv mo 

GACGATGATC 


o o o omo m oo o 

GGCCTGTCGC 


fnmoooofn* mm 

TTGCGGTATT 


CGGAATCTTG 


CACGCCCTCG 


7900 


CTCAAGCCTT 


oonn^viv omoom 

CGTCACTGGT 


CCCGCCACCA 


AACGTTTCGG 


CGAGAAGCAG 


7950 


o oo TV mm tv moo 

GCCATTATCG 


o oo o o tv mo o o 

CCGGCATGGC 


GGCCGACGCG 


CTGGGCTACG 


TCTTGCTGGC 


8000 


ommoo oo a oo 


CGAGGCTGGA 


TGGCCTTCCC 


CATTATGATT 


CTTCTCGCTT 


8050 


CCGGCGGCAT 


fyr* O T\ mo ooo 

CGGGATGCCC 


GCGTTGCAGG 


CCATGCTGTC 


CAGGCAGGTA 


8100 


O TV mo TV OO TA OO 

GATGACGACC 


TV ^n^v TV TV V 

ATCAGGGACA 


GCTTCAAGGA 


TCGCTCGCGG 


CTCTTACCAG 


8150 


ooititv t\ ommoo 

CCTAACTTCG 


ATCACTGGAC 


CGCTGATCGT 


CACGGCGATT 


TATGCCGCCT 


8200 


tv ootv o 

CGGCGAGCAC 


iv moo iv iv oooo 

ATGGAACGGG 


TTGGCATGGA 


TTGTAGGCGC 


CGCCCTATAC 


8250 


ommofnorno oo 

CTTGTCTGCC 


TCCCCGCGTT 


GCGTCGCGGT 


GCATGGAGCC 


GGGCCACCTC 


A 

8300 


o 7\ oomo 7\ 7v mo 
vjALC I GAATG 


O TV TV O O /"»0 

GAAGCCGGCG 


GCACCTCGCT 


AACGGATTCA 


CCACTCCAAG 


8350 


TA TAmmOOAOOO 


tv tv mo tv tv mmom 

AATCAATTCT 


mo ooo iv o iv n ^ 

TGCGGAGAAC 


TGTGAATGCG 


CAAACCAACC 


8400 


ommo o o TV o TV tv 


o ta m ja moo ja mo 
CA I ATCCATC 


o oomooooo iv 

GCGTCCGCCA 


TCTCCAGCAG 


CCGCACGCGG 


8450 


oootv momooo 


oota ooommoo 


GTCCTGGCCA 


CGGGTGCGCA 


TGATCGTGCT 


n c? Art 

8500 


CCTGTCGTTG 


AGGACCCGGC 


TAGGCTGGCG 


GGGTTGCCTT 


ACTGGTTAGC 


8550 


AGAATGAATC 


ACCGATACGC 


GAGCGAACGT 


GAAGCGACTG 


CTGCTGCAAA 


8600 


ACGTCTGCGA 


CCTGAGCAAC 


AACATGAATG 


GTCTTCGGTT 


TCCGTGTTTC 


8650 


GTAAAGTCTG 


GAAACGCGGA 


AGTCAGCGCC 


CTGCACCATT 


ATGTTCCGGA 


8700 


TCTGCATCGC 


AGGATGCTGC 


TGGCTACCCT 


GTGGAACACC 


TACATCTGTA 


8750 


TTAACGAAGC 


CTTTCTCAAT 


GCTCACGCTG 


TAGGTATCTC 


AGTTCGGTGT 


8800 
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AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
GACCGCTGCG CCTTATCCGG TAACTATCGT 
ACACGACTTA TCGCCACTGG CAGCAGCCAC 
CGAGGTATGT AGGCGGTGCT ACAGAGTTCT 
GGCTACACTA GAAGGACAGT ATTTGGTATC 
TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
AAAGGATCTC AAGAAGATCC TTTGATCTTT 
GTGGAACGAA AACTCACGTT AAGGGATTTT 
GGATCTTCAC CTAGATCCTT TTAAATTAAA 
TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 
TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
GACTCCCCGT CGTGTAGATA ACTACGATAC 
CCCAGTGCTG CAATGATACC GCGAGACCCA 
ATCAGCAATA AACCAGCCAG CCGGAAGGGC 
CAACTTTATC CGCCTCCATC CAGTCTATTA 
GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 
AGGCATCGTG GTGTCACGCT CGTCGTTTGG 
GTTCCCAACG ATCAAGGCGA GTTACATGAT 
GCGGTTAGCT CCTTCGGTCC TCCGATCGTT 
AGTGTTATCA CTCATGGTTA TGGCAGCACT 
TGCCATCCGT AAGATGCTTT TCTGTGACTG 
TTCTGAGAAT AGTGTATGCG GCGACCGAGT 
ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 
GAAAACGTTC TTCGGGGCGA AAACTCTCAA 
TCCAGTTCGA TGTAACCCAC TCGTGCACCC 



ACGAACCCCC 


CGTTCAGCCC 


ooDU 


CTTGAGTCCA 


■» O m TV TV O 

ACCCGGTAAG 


Q Qfifi 


TGGTAACAGG 


ATTAGCAGAG 


QACO 

oyDu 


TGAAiaTGGTG 


GCCTAACTAC 


y uuu 


TGCGCTCTGC 


mo tv » o oo tv om 
TGAAG CCAGT 


QACA 


ATCCGGCAAA 


o * * » oo tv ooo 

CAAACCACCG 


yiuu 


AGCAGATTAC 


^ /"^ "* O TV TV TV TV 

GCG C AGAAAA 


qi cn 


TCTACGGGGT 


^fTlO TV OO / II IIOTV 

CTGACGCTCA 


y/uu 


A*OmO % TT\f* IV ^* TV 

GGTCATGAGA 


mm tv mo tv 7v tv tv tv 

TTAa caaaaa 




» % ^nf* * % Mill im 

AATGAAGTTT 


nix tv tv mo 7v tv rno 
TAAATCAA1 c 




n. o mm *k o r% tv x m 

AGTTACCAAT 


oornrniv tv mo TV O 
GCI 1AAICAG 


y J3u 


TCGTTCATCC 


TV m tv omrpo pprn 
Al AG1 XbULl 




GGGAGGGCTT 




y*t3y 


CGCTCACCGG 


CTCCAGATT I 


youu 


CGAGCGCAGA 


AGTGG a CL 1 G 




» mn^^tmm^^ ^^^^^^ 

ATTGTTGCCG 


OO TV TV O OfTI TV O 7V 

GG AAG CI AG A 


q Ann 


AACGTTGTTG 


n mmo ^tmo o 

CCATTGCTGC 


yoou 


TATGGCTTCA 


TTCAGCTCCG 


q*7 nn 
y / uu 


CCCCCATGTT 


/■» mO O TV TV TV TV TV TV 

GTGCAAAAAA 


y / 3u 


6TCA6AAGTA 


AGTTGGCCGC 


youu 


GCATAATTCT 


CTTACTGTCA 


9850 


GTGAGTACTC 


AACCAAGTCA 


9900 


TGCTCTTGCC 


CGGCGTCAAC 


9950 


TTTAAAAGTG 


CTCATCATTG 


10000 


GGATCTTACC 


GCTGTTGAGA 


10050 


AACTGATCTT 


CAGCATCTTT 


10100 
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TACTTTCACC 


AGCGTTTCTG 


3GTGAGCAAA 


AACAGGAAGG 


CAAAATGCCG 


10150 


CAAAAAAGGG 


AATAAGGGCG 


ACACGGAAAT 


GTTGAATACT 


CATACTCTTC 


10200 


CTTTTTCAAT 


ATTATTGAAG 


CATTTATCAG 


GGTTATTGTC 


TCATGAGCGG 


10250 


ATACATATTT 


GAATGTATTT 


AGAAAAATAA 


ACAAATAGGG 


GTTCCGCGCA 


10300 


CATTTCCCCG 


AAAAGTGCCA 


CCTGACGTCT 


AAGAAACCAT 


TATTATCATG 


10350 


ACATTAACCT 


ATAAAAATAG 


GCGTATCACG 


AGGCCCTTTC 


GTCTTCAA 


10398 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



-TCGCGCGTTT 


CGGTGATGAC 


GGTGAAAACC 


TCTGACACAT 


GCAGCTCCCG 


50 


GAGACGGTCA 


CAGCTTGTCT 


GTAAGCGGAT 


GCCGGGAGCA 


GACAAGCCCG 


100 


TCAGGGCGCG 


TCAGCGGGTG 


TTGGCGGGTG 


TCGGGGCTGG 


CTTAACTATG 


150 


CGGCATCAGA 


GCAGATTGTA 


CTGAGAGTGC 


ACCATATGCG 


GTGTGAAATA 


200 


CCGCACAGAT 


GCGTAAGGAG 


AAAATACCGC 


ATCAGGCGCC 


ATTCGCCATT 


250 


CAGGCTGCGC 


AACTGTTGGG 


AAGGGCGATC 


GGTGCGGGCC 


TCTTCGCTAT 


300 


TACGCCAGCT 


GGCGAAAGGG 


GGATGTGCTG 


CAAGGCGATT 


AAGTTGGGTA 


350 


ACGCCAGGGT 


TTTCCCAGTC 


ACGACGTTGT 


AAAACGACGG 


CCAGTGCCAA 


400 


GCTTGCATGC 


CTGCAGGTCG 


ACTCTAGAGG 


ATCCGAAAAA 


ACCTCCCACA 


450 


CCTCCCCCTG 


AACCTGAAAC 


ATAAAATGAA 


TGCAATTGTT 


GTTGTTAACT 


500 


TGTTTATTGC 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


CATCACAAAT 


550 


TTCACAAATA 


AAG CATTTTT 


TTCACTGCAT 


TCTAGTTGTG 


GTTTGTCCAA 


600 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


GCCAAATCAT 


650 
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TTATTGTTCA AAGATGCAGT CATCCAAATC CACATTGACC AGATCGCAGG 700 

CAGTGCAAGC GTCTGGCACC TTTCCCATGA TATGATGAAT GTAGCACAGT 750 

TTCTGATACG CCTTTTTGAC GACAGAAACG GGTTGAGATT CTGACACGGG 800 

AAAGCACTCT AAACAGTCTT TCTGTCCGTG AGTG/JK3CAG ATATTTGAAT 850 

TCTGATTCAT TCTCTCGCAT TGTCTGCAGG GAAACAGCAT CAGATTCATG 900 

CCCACGTGAC GAGAACATTT GTTTTGGTAC CTGTCTGCGT AGTTGATCGA 950 

AGCTTCCGCG TCTGACGTCG ATGGCTGCGC AACTGACTCG CGCACCCGTT 1000 

TGGGCTCACT TATATCTGCG TCACTGGGGG CGGGTCTTTT CTTGGCTCCA 1050 

CCCTTTTTGA CGTAGAATTC ATGCTCCACC TCAACCACGT GATCCTTTGC 1100 

CCACCGGAAA AAGTCTTTGA CTTCCTGCTT GGTGACCTTC CCAAAGTCAT 1150 

GATCCAGACG GCGGGTGAGT TCAAATTTGA ACATCCGGTC TTGCAACGGC 1200 

TGCTGGTGTT CGAAGGTCGT TGAGTTCCCG TCAATCACGG CGCACATGTT 1250 

GGTGTTGGAG GTGACGATCA CGGGAGTCGG GTCTATCTGG GCCGAGGACT 1300 

TGCATTTCTG GTCCACGCGC ACCTTGCTTC CTCCGAGAAT GGCTTTGGCC 1350 

GACTCCACGA CCTTGGCGGT CATCTTCCCC TCCTCCCACC AGATCACCAT 1400 

CTTGTCGACA CAGTCGTTGA AGGGAAAGTT CTCATTGGTC CAGTTTACGC 1450 

ACCCGTAGAA GGGCACAGTG TGGGCTATGG CCTCCGCGAT GTTGGTCTTC 1500 

CCGGTAGTTG CAGGCCCAAA CAGCCAGATG GTGTTCCTCT TGCCGAACTT 1550 

TTTCGTGGCC CATCCCAGAA AGACGGAAGC CGCATATTGG GGATCGTACC 1600 

CGTTTAGTTC CAAAATTTTA TAAATCCGAT TGCTGGAAAT GTCCTCCACG 1650 

GGCTGCTGGC CCACCAGGTA GTCGGGGGCG GTTTTAGTCA GGCTCATAAT 1700 

CTTTCCCGCA TTGTCCAAGG CAGCCTTGAT TTGGGACCGC GAGTTGGAGG 1750 

CCGCATTGAA GGAGATGTAT GAGGCCTGGT CCTCCTGGAT CCACTGCTTC 1800 

TCCGAGGTAA TCCCCTTGTC CACGAGCCAC CCGACCAGCT CCATGTACCT 1850 

GGCTGAAGTT TTTGATCTGA TCACCGGCGC ATCAGAATTG GGATTCTGAT 1900 

TCTCTTTGTT CTGCTCCTGC GTCTGCGACA CGTGCGTCAG ATGCTGCGCC 1950 
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ACCAACCGTT 

*B w ^B» ^BrB> «B> A ^B» ^BT ^bv 


TACGCTCCGT 


-CAPrATTPA 71 71 


. wAGGCGCTTA 


. AATACTGTTC 


^Bk ^Bh. ^Bk 

2000 


CATATTAGTC 

^B ^» ^^B T 4BV ^B O BaB ^BBW 


CACGCCCACT 


ww** ww X wAwVl 


w X X 1 T X G 


GGGAGCAAGT 


BV*A ^B> AW 

2050 


AATTGGGGAT 

• • •»» ^bt ^* ^b*4 4 


GTAGCACTCA 


TPPAPPAPPT 1 

X w wA^* wAw w x 


lull wwCGCC 


TCCGGCGCCA 


2100 


TTTCTGGTCT 


TTGTGAPPGP 


Ct A A PP A pfpmm 


GGCaAAGTCG 


GCTCGATCCC 


2150 


GCGGTAAATT 


CTCTGAATPA 

w x w x uxui x 


UlllXl w w 1 wVJ 


AATCTGACTC 


AGGAAACGTC 


2200 


CCAAAACCAT 


GG A m TTP A PP 

VJwfl X x x wA w w 


opoo mo omrprn 


CCACGAGCAC 


GTGCATGTGG 


2250 


AAGTAGCTCT 


pmppommpmp 


AAnl iwLAUA 


AAGAAAAGGG 


CCTCCGGGGC 


2300 


CTTACTCACA 

w x x fx w x ^nvxi 


ww'www'wwAX x 


w Uvj x wAvjAAA 


GTCGCGCTGC 


AGCTTCTCGG 


2350 


CCACGGTCAG 


GGGTGCCTGC 

www X WW w X WW 


TP A A 'TP 2V P a *P 
X Caa X UAuA X 


I CAGATCCAT 


GTCAGAATCT 


2400 


GGCGGCAACT 

w ^bt W ^bv ^BT4 44 4*^^ ^B> 


CCCATTCCTT 

vn x x x x 


w X wUO w wAw w 


t^AG X TCACAA 


AGCTGTCAGA 


^Bl A b«B> * 

2450 


AATGCCGGGC 


AGATGCCCGT 


CAACC f TPnP r r 

wXUlUU X www x 


bwbbALt X X A 


ATCACAATCT 


4*X ^™ A^B. ^B> 

2500 


CGTAAAACCC 


CGGCATGGCG 


GCTfiPCiPnTT 

ww X w w w w w X X 


w AAA ww X UC C 


GCTTCAAAAT 


B«Bl ^bv ^B» ^Bv 

2550 


GGAGACCCTG 


CGTGCTCACT 


CGGGCWIk A A 
wVjwvj w X inxin 


T* a OOOTA OO/rn 
XAwwwAGwGx 


GACCACATGG 


4*% ^^B* *f»4 

2600 


TGTCGCAAAA 

^b» ^b» ^B» ^B^ ^b»i ^B»** «a» «b 44V 4- 


TGTCGCAAAA 


wr*w x UAUu X w 


A ww X C X AATA 


CAGGACTCTA 


2650 


GAGGATCCCC 


GGGTACCGAG 


PTPGAATTPfl 
w x vVJnAl X ww* 


rp 7A ta mo n moom 
X AA X CA I GGT 


CATAGCTGTT 


4*% ■ ^B> A 

2700 


TCCTGTGTGA 


AATTGTTATP 

X X w X X X\X w 


PCPTP21P1 TV rn 


TCCACACAAC 


ATACGAGCCG 


2750 


GAAGCATAAA 


GTGTAAAP.PP 

w x \j x nruiu w w 


X vjoVjvj X uLL X 


AATGAGTGAG 


CTAACTCACA 


2800 


TTAATTGCGT 


TGCGCTCAPT 


GPPP^PTTT^ 
w w w wVjw X X X w 


CAGTCGGGAA 


ACCTGTCGTG 


2850 


CCAGCTGCAT 


TAATGAATPC 

x nn x uaa x v»w 


VsLUiAUuLbU 


GGGGAGAGGC 


GGTTTGCGTA 


^Bk J ^B* ^Bk ^Bk 

2900 


TTGGGCGCTC 


TTPPGPTTPP 

X X ww ww X X ww 


TPfiPTPAPTP 
X vuL X L*Aw X Lj 


7A omooom^i^^ 

ACTCGCTGCG 


CTCGGTCGTT 


2950 


CGGCTGC66C 


GAGCGGTATC 


AGCTCACTCA 


AAGGCGGTAA 


TACGGTTATC 


3000 


CACAGAATCA 


GGGGATAACG 


CAGGAAAGAA 


CATGTGAGCA 


AAAGGCCAGC 


3050 


AAAAGGCCAG 


GAACCGTAAA 


AAGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


3100 


CTCCGCCCCC 


CTGACGAGCA 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


3150 


GCGAAACCCG 


ACAGGACTAT 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


3200 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG , 


ATACCTGTCC 


3250 
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GCCTTTCTCC 


r*r%yt\\ r* A A P 
Cx X CG GG AAb 


LulubUuv^l X 


TPTPATAflPT 


CACGCTGTAG 


3300 


GTATCTCAGT 


TCGG IbX AGG 




pa jLrznTriczcic 


TGTGTGPAPG 

X w X w X w Wlvw 


3350 


AACCCCCCGT 




\m+\J\~ X VjV-»Vj W W X 


X AX v^VvVJVJ Xt\r\ 


CTATCGTCTT 

\m+ X X V*»\J X W X X 


3400 


0m -m rn ^% ^% n tv ^\ 

GAGTCCAACC 


CGG 1 AAG AC A 


Lunv« X X A X wo 


ppApS nrtPAC 


PAGCCACTGG 


3450 


TAACAGGATT 


tv O A j*" A P* /"»/"» 7A 
AG C AGAG GGA 






GAGTTCTTGA 

wftw X X w X X VJX* 


3500 


AGTGGTGGCC 


T AACTACGG G 


X AC AC X AVjAri 


n^ZAPA^tTATT 
ubnv«AVJlnx X 


TGGTATCTGC 

x ww x n x >0 x w w 


3550 


GCTCTGCTGA 


AGCCAGTTAC 


PTTPPP A A A A 


A n A ^TT^ItT A 


GPTPTTGATP 

VJvlVv X X wAx w 


3600 


CGGCAAACAA 


^ 0m 0m m\ 0m 0m 0m ^%^nf* 

ACCACCGCTG 


GTAGCGG X CjU 


M if in in in l ii l ii l i/^'l'ip 
X X X X X X Ibl X 


m/^p a AGP AGP 
X uv^AAwWivju 


3650 


AGATTACGCG 


CAGAAAAAAA 


GG A X c X LAAb 


A A P 1 A rp/T ,r P r P r P 
AAGA X CC XXX 


A XV— XX llv*l 


3700 


ACGGGGTCTG 


ACGCTCAGTG 


A A A A A A 

CjAALvjAAAAC 


•PPaPPTTl An 
X CACl? X X t\r\\J 


GG A TTTTGGT 
uun X X X X ww x 


3750 


CATGAGATTA 


ni/iK TV TV TV TV f*f* TA 

T C AAAAAGGA 


X C X X LALV- X A 


bnl CCX X X X A 


AATTAAAAAT 

AAX X AAAAAX 


3800 


GAAGTTTTAA 


tv rno tv tv m/" , n"> A A 
Ax CAAX CXAA 


Ablnlnliilu 


ACTA A APTTC 


GTPTGAPAGT 

\j X w x \jn ^>x*vj x 


3850 


TACCAATGCT 


m a A n"ip» Jirmr 1 A 






GTCTATTTCG 


3900 


^nm tv m iv m tv 

TTCATCCATA 


G X 1 G C G X uAL 


X ^»^»\M*^urX wwX 


fZTAGAT'AACT 


ACGATACGGG 


3950 


AGGGCTT AC C 


A1L1 GG CCCC 


nb X bV^ X url^/Vn 


t 

y ■■* 

TGATAPCGCG 


AGACCCACGC 


4000 


TCACCGGCTC 


/■» X A (|H It'll a n^p* 

CAGAX X J.A1L 


AbLAA X aa/Iw 


PAGPPAGPPG 

\m»Aw Vm* V>Aw V»»ww 


GAAGGGCCGA 


4050 


GCGCAGAAGT 


GGTCCTGCAA 


nmmm a rnf^CT* C* 
UX X lAltUbL 


PTPPlTPPAt^ 


TPTATTAATT 

X w X A X X AAX X 


4100 


GTTGCCGGGA 


■m ^ fyrry tv o tv /*• m A 

AGCTAGAGTA 


AG X Avj X X LbL 


CAVjX XAAlAb 


TTTGPGPAAC 

XXX uvuvATlv 


4150 


GTTGTTGCCA 


mm^ TV ^V TV O 

TTGCTACAGG 


O A rno/^fpi r, P ,r rP' 


m P A PP P m P*^T 
X CACVjC X tu X 


PGTTTGGTAT 

ww XXX wVJ X f% X 


4200 


GGCTTCATTC 


AGCTCCGGTT 


CCCAACGAX C 


AAubtuAu X X 


APATGATCCC 
nvn x wA x www 


4250 


CCATGTTGTG 


CAAAAAAGCG 


GTTAGCTCCT 


TCGGTCCTCC 


GATCGTTGTC 


^ ^% ^% 

4300 


AGAAGTAAGT 


TGGCCGCAGT 


GTTATCACTC 


ATGGTTATGG 


CAGCACTGCA 


4350 


TAATTCTCTT 


ACTGTCATGC 


CATCCGTAAG 


ATGCTTTTCT 


GTGACTGGTG 


4400 


AGTACTCAAC 


CAAGTCATTC 


TGAGAATAGT 


GTATGCGGCG 


ACCGAGTTGC 


4450 


TCTTGCCCGG 


CGTCAATACG 


GGATAATACC 


GCGCCACATA 


GCAGAACTTT 


4500 


AAAAGTGCTC 


ATCATTGGAA 


AACGTTCTTC 


GGGGCGAAAA 


CTCTCAAGGA 


4550 
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TCTTACCGCT GTTGAGATCC 
TGATCTTCAG CATCTTTTAC 
AGGAAGGCAA AATGCCGCAA 
GAATACTCAT ACTCTTCCTT 
TATTGTCTCA TGAGCGGATA 
AATAGGGGTT CCGCGCACAT 
AAACCATTAT TATCATGACA 
CCCTTTCGTC 



61 

AGTTCGATGT AACCCACTCG 
TTTCACCAGC GTTTCTGGGT 
AAAAGGGAAT AAGGGCGACA 
TTTCAATATT ATTGAAGCAT 
CATATTTGAA TGTATTTAGA 
TTCCCCGAAA AGTGCCACCT 
TTAACCTATA AAAATAGGCG 



TGCACCCAAC 4600 

GAGCAAAAAC 4650 

CGGAAATGTT 4700 

TTATCAGGGT 4750 

AAAATAAACA 4800 

GACGTCTAAG 4850 

TATCACGAGG 4900 

4910 
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WHAT IS CLAIMED IS; 

1. A recombinant hybrid virus comprising: 

(a) DNA sequences of, or corresponding 
to, the 5 9 inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(b) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 

(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(d) DNA sequences of, or corresponding 

to, the 3 1 AAV ITR sequences; 

(e) DNA sequences of, or corresponding 

to, the 3 f adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell. 

2. The virus according to claim 1 wherein 
said adenovirus is rendered replication defective by a 
deletion in all or a part of the El gene. 



3. The virus according to claim 2 wherein 
said adenovirus genome has a deletion in all or a part 
the E3 gene. 
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4. The virus according to claim 1 wherein 
said adenovirus genome comprising deletions in the DNA 
sequences of all or a portion of the adenovirus genes 
selected from the group consisting of the E2a gene, the 
E4 gene, the late genes LI through L5, the intermediate 
genes IX and IV a , and a combination thereof. 

5. The virus according to claim 1 wherein 
said selected gene is a reporter gene. 

6. The virus according to claim 5 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding 6-galactosidase, alkaline 
phosphatase and green fluorescent protein. 

7. The virus according to claim 1 wherein 
said selected gene is a therapeutic gene. 

8. The virus according to claim 7 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

9. The virus according to claim 1 further 
comprising the DNA of, or corresponding to, a functional 
portion of the genome of an adeno-associated virus rep 
gene. 

10. A recombinant hybrid vector comprising: 

(a) DNA sequences of, or corresponding 
to, the 5' inverted terminal repeat (ITR) sequ nces of an 
adenovirus and the 5 f adenovirus packaging/ enhancer 
domain; 

(b) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 
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(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(d) DNA sequences of, or corresponding 
to , the 3 • AAV ITR sequences ; 

(e) DNA sequences of, or corresponding 
to, the 3' adenovirus ITR sequences; and 

(d) plasmid DNA sequences containing 
regulatory elements. 

11. A recombinant trans-infection particle 

comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5 1 inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(iv) DNA sequences of, or corresponding 
to, the 3' AAV ITR sequences; 

(v) DNA sequences of, or corresponding 
to , the 3 ' adenovirus ITR sequences ; 

wherein said virus is replication- 
def ctive and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 

target cell; 

(b) a polycation sequence conjugated to said 

hybrid virus; and 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

12. The trans-infection particle according to 
claim 11 wherein said adenovirus DNA lacks the sequence 
encoding viral genes. 

13. The trans-infection particle according to 
claim 11 wherein said adenovirus genome is rendered 
replication-defective by a deletion in all or a part of 
the El gene. 

14. The particle according to claim 13 wherein 
said adenovirus genome has a deletion in all or a part of 
the E3 gene. 

15. The particle according to claim 11 wherein 
said adenovirus genome has deletions in the DNA sequences 
of all or a portion of the adenovirus genes selected from 
the group consisting of the E2a gene, the E4 gene, the 
late genes LI through L5, the intermediate genes IX and 
IV a , and a combination thereof. 

16. The particle according to claim 11 wherein 
said selected gene is a reporter gene. 

17. The particle according to claim 16 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding B-galactosidase, alkaline 
phosphatase and green fluorescent protein. 

18. The particle according to claim 11 wherein 
said selected gene is a th rapeutic gene. 
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19. The particle according to claim 18 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

20. A composition for use in delivering and 
stably integrating a selected gene into the chromosome of 
a target cell, said composition comprising 

(a) a recombinant hybrid virus 

comprising: 

(i) DNA sequences of, or 
corresponding to, the 5' inverted terminal repeat (ITR) 
sequences of an adenovirus and the 5 1 adenovirus 
packaging/ enhancer domain; 

(ii) DNA sequences of , or 
corresponding to, the 5' adeno-associated virus (AAV) ITR 



(iii) a gene encoding a selected 
protein operatively linked to regulatory sequences 
directing expression of the protein in a target cell in 
vivo or in vitro; 

(iv) DNA sequences of, or 
corresponding to, the 3' AAV ITR sequences; 

(v) DNA sequences of, or 
corresponding to, the 3' adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 

target cell; and 

(b) a pharmaceutically acceptable 



21. The composition according to claim 20 
further comprising a plasmid comprising an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gene in said target cell. 
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22. The composition according to claim 21 
wherein said vector further comprises the DNA of, or 
corresponding to, at least a functional portion of the 
genome of an adeno-associated virus rep gene. 

23. A composition for use in delivering and 
stably integrating a selected gene into the chromosome of 
a target cell comprising an effective amount of a 
recombinant trans-infection particle in a 
pharmaceutical ly acceptable carrier, said particle 
comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5 1 inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5 1 adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vi tro ; 

(iv) DNA sequences of, or corresponding 
to, the 3' AAV ITR sequences; 

(v) DNA sequences of, or corresponding 
to , the 3 1 adenovirus ITR sequences ; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; 

(b) a polycation sequence conjugated to said 
hybrid virus; 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

24. A mammalian cell capable of expressing a 
selected gene introduced therein through transduction of 
the virus of claim 1, the vector of claim 10 , or the 
trans-infection particle of claim 11. 

25. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 

of 

(a) culturing a cell co-transf ected with 
the vector of claim 10 and an optional helper virus in 
the presence of a plasmid containing an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gene; and 

(b) isolating fpom said culture a 

recombinant AAV. 

26. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 

of 

(a) culturing a cell transfected with the 

particle of claim 11 and 

(b) isolating from said culture a 

recombinant AAV. 
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GAATTCGCTA GCATCATCAA 
GGGGGTGGAG TTTGTGACGT 
GGCGGAAGTG TGATGTTGCA 
TGACGTTTTT GGTGTGCGCC 
GGATGTTGTA GTAAATTTGG 
GAATAAGAGG AAGTGAAATC 
AGGGAGATCT GCTGCGCGCT 
GGGCGACCTT TGGTCGCCCG 
ACTCCATCAC TAGGGGTTCC 
TTCGAGCTTG CATGCCTGCA 
CCGCCCAACG ACCCCCGCCC 
ATAGGGACTT TCCATTGACG 
GTACATCAAG TGTATCATAT 
CCCGCCTGGC ATTATGCCCA 
TACGTATTAG TCATCGCTAT 
GGATAGCGGT TTGACTCACG 
TTGTTTTGGC ACCAAAATCA 
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FIGURE 2A 

TAATATACCT TATTTTGGAT 
GGCGCGGGGC GTGGGAACGG 
AGTGTGGCGG AACACATGTA 
GGTGTACACA GGAAGTGACA 
GCGTAACCGA GTAAGATTTG 
TGAATAATTT TGTGTTACTC 
CGCTCGCTCA CTGAGGCCGC 
GCCTCAGTGA GCGAGCGAGC 
TTGTAGTTAA TGATTAACCC 
GGTCGTTACA TA^CTTACGG 
ATTGACGTCA ATAATGACGT 
TCAATGGGTG GAGTATTTAC 
GCCAAGTACG CCCCCTATTG 
GTACATGACC TTATGGGACT 
TACCATGGTG ATGCGGTTTT 
GGGATTTCCA AGTCTCCACC 
ACGGGACTTT CCAAAATGTC 
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TGAAGCCAAT ATGATAATGA 

60 



GGCGGGTGAC GTAGTAGTGT 

120 



AGCGACGGAT GTGGCAAAAG 

180 

ATTTTCGCGC GGTTTTAGGC 

240 

GCCATTTTCG CGGGAAAACT 

300 

ATAGCGCGTA ATATTTGTCT 

360 



CCGGGCAAAG CCCGGGCGTC 

420 



GCGCAGAGAG GGAGTGGCCA 

480 



GCCATGCTAC TTATCTACAA 

540 



TAAATGGCCC GCCTGGCTGA 

600 



ATGTTCCCAT AGTAACGCCA 

660 



GGTAAACTGC CCACTTGGCA 

720 



ACGTCAATGA CGGTAAATGG 

780 



TTCCTACTTG GCAGTACATC 

840 



GGCAGTACAT CAATGGGCGT 

900 



CCATTGACGT CAATGGGAGT 

960 



GTAACAACTC CGCCCCATTG 

1020 
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ACGCAAATGG GCGGTAGGCG 
AACCGTCAGA TCGCCTGGAG 
GACCGATCCA GCCTCCGGAC 
GTTAACTGGT AAGTTTAGTC 
CAAATCAAAG AACTGCTCCT 
TTACTTCTGC TCTAAAAGCT 
GAGCCTGCTA AAGCAAAAAA 
TTTTCGTTGC CGGTCTGGGA 
ATCCCGTCGT TTTACAACGT 
TTGCAGCACA TCCCCCTTTC 
CTTCCCAACA GTTGCGCAGC 
AAGCGGTGCC GGAAAGCTGG 
CCTCAAACTG GCAGATGCAC 
TTACGGTCAA TCCGCCGTTT 
TTAATGTTGA TGAAAGCTGG 
ACTCGGCGTT TCATCTGTGG 
TGCCGTCTGA ATTTGACCTG 
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FIGURE 2B 

TGTACGGTGG GAGGTCTATA 

ACGCCATCCA CGCTGTTTTG 

TCTAGAGGAT CCGGTACTCG 

TTTTTGTCTT TTATTTCAGG 

CAGTGGATGT TGCCTTTACT 

GCGGAATTGT ACCCGCGGCC 

GAAGTCACCA TGTCGTTTAC 

GGCATTGGTC TGGACACCAG 

CGTGACTGGG AAAACCCTGG 

GCCAGCTGGC GTAATAGCGA 

CTGAATGGCG AATGGCGCTT 

CTGGAGTGCG ATCTTCCTGA 

GGTTACGATG CGCCCATCTA 

GTTCCCACGG AGAATCCGAC 

CTACAGGAAG GCCAGACGCG 

TGCAACGGGC GCTGGGTCGG 

AGCGCATTTT TACGCGCCGG 
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TAAGCAGAGC TCGTTTAGTG 

1080 

ACCTCCATAG AAGACACCGG 

1140 

AGGAACTGAA AAACCAGAAA 

1200 

TCCCGGATCC GGTGGTGGTG 

1260 

TCTAGGCCTG TACGGAAGTG 

1320 

GCAATTCCCG GGGATCGAAA 

1380 

TTTGACCAAC AAGAACGTGA 

1440 

CAAGGAGCTG CTCAAGCGCG 

1500 

CGTTACCCAA CTTAATCGCC 

1560 



AGAGGCCCGC ACCGATCGCC 

1620 

TGCCTGGTTT CCGGCACCAG 

1680 

GGCCGATACT GTCGTCGTCC 

1740 



CACCAACGTA ACCTATCCCA 

1800 

GGGTTGTTAC TCGCTCACAT 

1860 

AATTATTTTT GATGGCGTTA 

1920 



TTACGGCCAG GACAGTCGTT 

1980 



AGAAAACCGC CTCGCGGTGA 

2040 



) 
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FIGURE 2C 



TGGTGCTGCG TTGGAGTGAC GGCAGTTATC TGGAAGATCA GGATATGTGG CGGATGAGCG 

2100 



GCATTTTCCG TGACGTCTCG TTGCTGCATA AACCGACTAC ACAAATCAGC GATTTCCATG 

2160 



TTGCCACTCG CTTTAATGAT GATTTCAGCC GCGCTGTACT GGAGGCTGAA GTTCAGATGT 

2220 



GCGGCGAGTT GCGTGACTAC CTACGGGTAA CAGTTTCTTT ATGGCAGGGT GAAACGCAGG 

2280 



TCGCCAGCGG CACCGCGCCT TTCGGCGGTG AAATTATCGA TGAGCGTGGT GGTTATGCCG 

2340 



ATCGCGTCAC ACTACGTCTG AACGTCGAAA ACCCGAAACT GTGGAGCGCC GAAATCCCGA 

2400 



ATCTCTATCG TGCGGTGGTT GAACTGCACA CCGCCGACGG CACGCTGATT GAAGCAGAAG 

2460 



CCTGCGATGT CGGTTTCCGC GAGGTGCGGA TTGAAAATGG TCTGCTGCTG CTGAACGGCA 

2520 



AGCCGTTGCT GATTCGAGGC GTTAACCGTC ACGAGCATCA TCCTCTGCAT GGTCAGGTCA 

2580 



TGGATGAGCA GACGATGGTG CAGGATATCC TGCTGATGAA GCAGAACAAC TTTAACGCCG 

2640 



TGCGCTGTTC GCATTATCCG AACCATCCGC TGTGGTACAC GCTGTGCGAC CGCTACGGCC 

2700 



TGTATGTGGT GGATGAAGCC AATATTGAAA CCCACGGCAT GGTGCCAATG AATCGTCTGA 

2760 



CCGATGATCC GCGCTGGCTA CCGGCGATGA GCGAACGCGT AACGCGAATG GTGCAGCGCG 

2820 



ATCGTAATCA CCCGAGTGTG ATCATCTGGT CGCTGGGGAA TGAATCAGGC CACGGCGCTA 

2880 



ATCACGACGC GCTGTATCGC TGGATCAAAT CTGTCGATCC TTCCCGCCCG GTGCAGTATG 

2940 



AAGGCGGCGG AGCCGACACC ACGGCCACCG ATATTATTTG CCCGATGTAC GCGCGCGTGG 



ATGAAGACCA GCCCTTCCCG GCTGTGCCGA AATGGTCCAT CAAAAAATGG CTTTCGCTAC 
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CTGGAGAGAC GCGCCCGCTG 
GCGGTTTCGC TAAATACTGG 
TCTGGGACTG GGTGGATCAG 
CTTACGGCGG TGATTTTGGC 
TCTTTGCCGA CCGCACGCCG 
TCCAGTTCCG TTTATCCGGG 
GCGATAACGA GCTCCTGCAC 
AAGTGCCTCT GGATGTCGCT 
AGCCGGAGAG CGCCGGGCAA 
CATGGTCAGA AGCCGGGCAC 
GTGTGACGCT CCCCGCCGCG 
TTTGCATCGA GCTGGGTAAT 
AGATGTGGAT TGGCGATAAA 
CACCGCTGGA TAACGACATT 
TCGAACGCTG GAAGGCGGCG 
CAGATACACT TGCTGATGCG 
AAACCTTATT TATCAGCCGG 
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FIGURE 2D 

ATCCTTTGCG AATACGCCCA 
CAGGCGTTTC GTCAGTATCC 
TCGCTGATTA AATATGATGA 
GATACGCCGA ACGATCGCCA 
CATCCAGCGC TGACGGAAGC 
CAAACCATCG AAGTGACCAG 
TGGATGGTGG CGCTGGATGG 
CCACAAGGTA AACAGTTGAT 
CTCTGGCTCA CAGTACGCGT 
ATCAGCGCCT GGCAGCAGTG 
TCCCACGCCA TCCCGCATCT 
AAGCGTTGGC AATTTAACCG 
AAACAACTGC TGACGCCGCT 
GGCGTAAGTG AAGCGACCCG 
GGCCATTACC AGGCCGAAGC 
GTGCTGATTA CGACCGCTCA 
AAAACCTACC GGATTGATGG 
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CGCGATGGGT AACAGTCTTG 

3120 

CCGTTTACAG GGCGGCTTCG 

3180 



AAACGGCAAC CCGTGGTCGG 

3240 

GTTCTGTATG AACGGTCTGG 

3300 

AAAACACCAG CAGCAGTTTT 

3360 

CGAATACCTG TTCCGTCATA 

3420 

TAAGCCGCTG GCAAGCGGTG 

3480 



TGAACTGCCT GAACTACCGC 

3540 



AGTGCAACCG AACGCGACCG 

3600 

GCGTCTGGCG GAAAACCTCA 

3660 

GACCACCAGC GAAATGGATT 

3720 



CCAGTCAGGC TTTCTTTCAC 

3780 



GCGCGATCAG TTCACCCGTG 

3840 



CATTGACCCT AACGCCTGGG 

3900 



AGCGTTGTTG CAGTGCACGG 

3960 



CGCGTGGCAG CATCAGGGGA 

4020 



TAGTGGTCAA ATGGCGATTA 

4080 
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FIGURE 2E 



CCGTTGATGT TGAAGTGGCG AGCGATACAC CGCATCCGGC GCGGATTGGC CTGAACTGCC 

4140 



AGCTGGCGCA GGTAGCAGAG CGGGTAAACT GGCTCGGATT AGGGCCGCAA GAAAACTATC 

4200 



CCGACCGCCT TACTGCCGCC TGTTTTGACC GCTGGGATCT GCCATTGTCA GACATGTATA 

4260 



CCCCGTACGT CTTCCCGAGC GAAAACGGTC TGCGCTGCGG GACGCGCGAA TTGAATTATG 

4320 



GCCCACACCA GTGGCGCGGC GACTTCCAGT TCAACATCAG CCGCTACAGT CAACAGCAAC 

4380 



TGATGGAAAC CAGCCATCGC CATCTGCTGC ACGCGGAAGA AGGCACATGG CTGAATATCG 

4440 



ACGGTTTCCA TATGGGGATT GGTGGCGACG ACTCCTGGAG CCCGTCAGTA TCGGCGGAAT 

4500 



TACAGCTGAG CGCCGGTCGC TACCATTACC AGTTGGTCTG GTGTCAAAAA TAATAATAAC 

4560 



CGGGCAGGCC ATGTCTGCCC GTATTTCGCG TAAGGAAATC CATTATGTAC TATTTAAAAA 

4620 



ACACAAACTT TTGGATGTTC GGTTTATTCT TTTTCTTTTA CTTTTTTATC ATGGGAGCCT 

4680 



ACTTCCCGTT TTTCCCGATT TGGCTACATG ACATCAACCA TATCAGCAAA AGTGATACGG 

4740 



GTATTATTTT TGCCGCTATT TCTCTGTTCT CGCTATTATT CCAACCGCTG TTTGGTCTGC 

4800 



TTTCTGACAA ACTCGGCCTC GACTCTAGGC GGCCGCGGGG ATCCAGACAT GATAAGATAC 

4860 



ATTGATGAGT TTGGACAAAC CACAACTAGA ATGCAGTGAA AAAAATGCTT TATTTGTGAA 

4920 



ATTTGTGATG CTATTGCTTT ATTTGTAACC ATTATAAGCT GCAATAAACA AGTTAACAAC 

4980 



AACAATTGCA TTCATTTTAT GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT TTTTTCGGAT 

5040 



CCTCTAGAGT CGAGTAGATA AGTAGCATGG CGGGTTAATC ATTAACTACA AGGAACCCCT 

5100 
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FIGURE 2F 



AGTGATGGAG TTGGCCACTC CCTCTCTGCG 
AAAGGTCGCC CGACGCCCGG GCTTTGCCCG 
CAGATCTGGA AGGTGCTGAG GTACGATGAG 
GGCGGTAAAC ATATTAGGAA CCAGCCTGTG 
GATCACTTGG TGCTGGCCTG CACCCGCGCT 
TGAGGTACTG AAATGTGTGG GCGTGGCTTA 
CTTATGTAGT TTTGTATCTG TTTTGCAGCA 
GATGGAAGCA TTGTGAGCTC ATATTTGACA 
CAGAATGTGA TGGGCTCCAG CATTGATGGT 
TTGACCTACG AGACCGTGTC TGGAACGCCG 
GCCGCTGCAG CCACCGCCCG CGGGATTGTG 
AGCAGTGCAG CTTCCCGTTC ATCCGCCCGC 
TTGGATTCTT TGACCCGGGA ACTTAATGTC 
CAGGTTTCTG CCCTGAAGGC TTCCTCCCCT 
CCAGACTCTG TTTGGATTTG GATCAAGCAA 
CGCGCGCGGT AGGCCCGGGA CCAGCGGTCT 
AGGACGTGGT AAAGGTGACT CTGGATGTTC 



CGCTCGCTCG CTCACTGAGG CCGGGCGACC 

5160 



GGCGGCCTCA GTGAGCGAGC GAGCGCGCAG 

5220 



ACCCGCACCA GGTGCAGACC CTGCGAGTGT 

5280 



ATGCTGGATG TGACCGAGGA GCTGAGGCCC 

5340 



GAGTTTGGCT CTAGCGATGA AGATACAGAT 

5400 



AGGGTGGGAA AGAATATATA AGGTGGGGGT 

5460 



GCCGCCGCCG CCATGAGCAC CAACTCGTTT 

5520 



ACGCGCATGC CCCCATGGGC CGGGGTGCGT 

5580 



CGCCCCGTCC TGCCCGCAAA CTCTACTACC 

5640 

TTGGAGACTG CAGCCTCCGC CGCCGCTTCA 

5700 



ACTGACTTTG CTTTCCTGAG CCCGCTTGCA 

5760 



GATGACAAGT TGACGGCTCT TTTGGCACAA 

5820 



GTTTCTCAGC AGCTGTTGGA TCTGCGCCAG 

5880 



CCCAATGCGG TTTAAAACAT AAATAAAAAA 

5940 



GTGTCTTGCT GTCTTTATTT AGGGGTTTTG 

6000 



CGGTCGTTGA GGGTCCTGTG TATTTTTTCC 

6060 



AGATACATGG GCATAAGCCC GTCTCTGGGG 

6120 
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TGGAGGTAGC ACCACTGCAG 
TAGCAGGAGC GCTGGGCGTG 
GGCAGGCCCT TGGTGTAAGT 
GATATGAGAT GCATCTTGGA 
CGGGGATTCA TGTTGTGCAG 
TCATGTAGCT TAGAAGGAAA 
TTTTCCATGC ATTCGTCCAT 
ATATTTCTGG GATCACTAAC 
TTTACAAAGC GCGGGCGGAG 
GCGTAGTTAC CCTCACAGAT 
TCTACCTGCG GGGCGATGAA 
AGCAGGTTCC TGAGCAGCTG 
ACCGGGTGCA ACTGGTAGTT 
ACTTCGTTAA GCATGTCCCT 
TCGCCGCCCA GCGATAGCAG 
TCCGCCGTAG GCATGCTTTT 
GTCACCTGCT CTACGGCATC 
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FIGURE 2G 
AGCTTCATGC TGCGGGGTGG 
GTGCCTAAAA ATGTCTTTCA 
GTTTACAAAG CGGTTAAGCT 
CTGTATTTTT AGGTTGGCTA 
AACCACCAGC ACAGTGTATC 
TGCGTGGAAG AACTTGGAGA 
AATGATGGCA ATGGGCCCAC 
GTCATAGTTG TGTTCCAGGA 
GGTGCCAGAC TGCGGTATAA 
TTGCATTTCC CACGCTTTGA 
GAAAACGGTT TCCGGGGTAG 
CGACTTACCG CAGCCGGTGG 
AAGAGAGCTG CAGCTGCCGT 
GACTCGCATG TTTTCCCTGA 
TTCTTGCAAG GAAGCAAAGT 
GAGCGTTTGA CCAAGCAGTT 
TCGATCCAGC ATATCTCCTC 
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TGTTGTAGAT GATCCAGTCG 

6180 

GTAGCAAGCT GATTGCCAGG 

6240 

GGGATGGGTG CATACGTGGG 

6300 

TGTTCCCAGC CATATCCCTC 

6360 

CGGTGCACTT GGGAAATTTG 

6420 

CGCCCTTGTG ACCTCCAAGA 

6480 

GGGCGGCGGC CTGGGCGAAG 

6540 

TGAGATCGTC ATAGGCCATT 

6600 

TGGTTCCATC CGGCCCAGGG 

6660 

GTTCAGATGG GGGGATCATG 

6720 



GGGAGATCAG CTGGGAAGAA 

6780 

GCCCGTAAAT CACACCTATT 

6840 

CATCCCTGAG CAGGGGGGCC 

6900 



CCAAATCCGC CAGAAGGCGC 

6960 

TTTTCAACGG TTTGAGACCG 

7020 

CCAGGCGGTC CCACAGCTCG 

7080 

GTTTCGCGGG TTGGGGCGGC 

7140 
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FIGURE 2H 



TTTCGCTGTA CGGCAGTAGT CGGTGCTCGT CCAGACGGGC CAGGGTCATG TCTTTCCACG 

7200 

GGCGCAGGGT CCTCGTCAGC GTAGTCTGGG TCACGGTGAA GGGGTGCGCT CCGGGCTGCG 

7260 

CGCTGGCCAG GGTGCGCTTG AGGCTGGTCC TGCTGGTGCT GAAGCGCTGC CGGTCTTCGC 

7320 

CCTGCGCGTC GGCCAGGTAG CATTTGACCA TGGTGTCATA GTCCAGCCCC TCCGCGGCGT 

7380 

GGCCCTTGGC GCGCAGCTTG CCCTTGGAGG AGGCGCCGCA CGAGGGGCAG TGCAGACTTT 

7440 

TGAGGGCGTA GAGCTTGGGC GCGAGAAATA CCGATTCCGG GGAGTAGGCA TCCGCGCCGC 

7500 

AGGCCCCGCA GACGGTCTCG CATTCCACGA GCCAGGTGAG CTCTGGCCGT TCGGGGTCAA 

7560 

AAACCAGGTT TCCCCCATGC TTTTTGATGC GTTTCTTACC TCTGGTTTCC ATGAGCCGGT 

7620 

GTCCACGCTC GGTGACGAAA AGGCTGTCCG TGTCCCCGTA TACAGACTTG AGAGGCCTGT 

7680 

CCTCGACCGA TGCCCTTGAG AGCCTTCAAC CCAGTCAGCT CCTTCCGGTG GGCGCGGGGC 

7740 

ATGACTATCG TCGCCGCACT TATGACTGTC TTCTTTATCA TGCAACTCGT AGGACAGGTG 

7800 

CCGGCAGCGC TCTGGGTCAT TTTCGGCGAG GACCGCTTTC GCTGGAGCGC GACGATGATC 

7860 

GGCCTGTCGC TTGCGGTATT CGGAATCTTG CACGCCCTCG CTCAAGCCTT CGTCACTGGT 

7920 



CCCGCCACCA AACGTTTCGG CGAGAAGCAG GCCATTATCG CCGGCATGGC GGCCGACGCG 

7980 



CTGGGCTACG TCTTGCTGGC GTTCGCGACG CGAGGCTGGA TGGCCTTCCC CATTATGATT 

8040 



CTTCTCGCTT CCGGCGGCAT CGGGATGCCC GCGTTGCAGG CCATGCTGTC CAGGCAGGTA 

8100 



GATGACGACC ATCAGGGACA GCTTCAAGGA TCGCTCGCGG CTCTTACCAG CCTAACTTCG 

8160 
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FIGURE 21 



ATCACTGGAC CGCTGATCGT CACGGCGATT TATGCCGCCT CGGCGAGCAC ATGGAACGGG 

8220 



TTGGCATGGA TTGTAGGCGC CGCCCTATAC CTTGTCTGCC TCCCCGCGTT GCGTCGCGGT 

8280 



GCATGGAGCC GGGCCACCTC GACCTGAATG GAAGCCGGCG GCACCTCGCT AACGGATTCA 

8340 



CCACTCCAAG AATTGGAGCC AATCAATTCT TGCGGAGAAC TGTGAATGCG CAAACCAACC 

8400 



CTTGGCAGAA CATATCCATC GCGTCCGCCA TCTCCAGCAG CCGCACGCGG CGCATCTCGG 

8460 



GCAGCGTTGG GTCCTGGCCA CGGGTGCGCA TGATCGTGCT CCTGTCGTTG AGGACCCGGC 

8520 



TAGGCTGGCG GGGTTGCCTT ACTGGTTAGC AGAATGAATC ACCGATACGC GAGCGAACGT 

8580 



GAAGCGACTG CTGCTGCAAA ACGTCTGCGA CCTGAGCAAC AACATGAATG GTCTTCGGTT 

8640 



TCCGTGTTTC GTAAAGTCTG GAAACGCGGA AGTCAGCGCC CTGCACCATT ATGTTCCGGA 

8700 

TCTGCATCGC AGGATGCTGC TGGCTACCCT GTGGAACACC TACATCTGTA TTAACGAAGC 

8760 



CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG 

8820 



GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 

8880 



CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG 

8940 



ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC 

9000 



GGCTACACTA GAAGGACAGT ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA 

9060 



AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT 

9120 



GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 

9180 
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TCTACGGGGT CTGACGCTCA 
TTATCAAAAA GGATCTTCAC 
TAAAGTATAT ATGAGTAAAC 
ATCTCAGCGA TCTGTCTATT 
ACTACGATAC GGGAGGGCTT 
CGCTCACCGG CTCCAGATTT 
AGTGGTCCTG CAACTTTATC 
GTAAGTAGTT CGCCAGTTAA 
GTGTCACGCT CGTCGTTTGG 
GTTACATGAT CCCCCATGTT 
GTCAGAAGTA AGTTGGCCGC 
CTTACTGTCA TGCCATCCGT 
TTCTGAGAAT AGTGTATGCG 
ACCGCGCCAC ATAGCAGAAC 
AAACTCTCAA GGATCTTACC 
AACTGATCTT CAGCATCTTT 
CAAAATGCCG CAAAAAAGGG 
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FIGURE 2J 
GTGGAACGAA AACTCACGTT 
CTAGATCCTT TTAAATTAAA 

i 

TTGGTCTGAC AGTTACCAAT 
TCGTTCATCC ATAGTTGCCT 
ACCATCTGGC CCCAGTGCTG 
ATCAGCAATA AACCAGCCAG 
CGCCTCCATC CAGTCTATTA 
TAGTTTGCGC AACGTTGTTG 
TATGGCTTCA TTCAGCTCCG 
GTGCAAAAAA GCGGTTAGCT 
AGTGTTATCA CTCATGGTTA 
AAGATGCTTT TCTGTGACTG 
GCGACCGAGT TGCTCTTGCC 
TTTAAAAGTG CTCATCATTG 
GCTGTTGAGA TCCAGTTCGA 
TACTTTCACC AGCGTTTCTG 
AATAAGGGCG ACACGGAAAT 
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AAGGGATTTT GGTCATGAGA 

9240 

AATGAAGTTT TAAATCAATC 

9300 

GCTTAATCAG TGAGGCACCT 

9360 

GACTCCCCGT CGTGTAGATA 

9420 

CAATGATACC GCGAGACCCA 

9480 



CCGGAAGGGC CGAGCGCAGA 

9540 

ATTGTTGCCG GGAAGCTAGA 

9600 

CCATTGCTGC AGGCATCGTG 

9660 

GTTCCCAACG ATCAAGGCGA 

9720 



CCTTCGGTCC TCCGATCGTT 

9780 



TGGCAGCACT GCATAATTCT 

9840 



GTGAGTACTC AACCAAGTCA 

9900 

CGGCGTCAAC ACGGGATAAT 

9960 



GAAAACGTTC TTCGGGGCGA 

10020 



TGTAACCCAC TCGTGCACCC 

10080 



GGTGAGCAAA AACAGGAAGG 

10140 



GTTGAATACT CATACTCTTC 

10200 
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FIGURE 2K 



CTTTTTCAAT ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 

10260 

GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 

10320 

CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG GCGTATCACG 

10380 

AGGCCCTTTC GTCTTCAA 

10398 
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WO 96/13598 

TCGCGCGTTT CGGTGATGAC 
CAGCTTGTCT GTAAGCGGAT 
TTGGCGGGTG TCGGGGCTGG 
ACCATATGCG GTGTGAAATA 
ATTCGCCATT CAGGCTGCGC 
TACGCCAGCT GGCGAAAGGG 
TTTCCCAGTC ACGACGTTGT 
ACTCTAGAGG ATCCGAAAAA 
TGCAATTGTT GTTGTTAACT 
CATCACAAAT TTCACAAATA 
ACTCATCAAT GTATCTTATC 
AAGATGCAGT CATCCAAATC 
TTTCCCATGA TATGATGAAT 
GGTTGAGATT CTGACACGGG 
ATATTTGAAT TCTGATTCAT 
CCCACGTGAC GAGAACATTT 
TCTGACGTCG ATGGCTGCGC 
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GGTGAAAACC TCTGACACAT 

GCCGGGAGCA GACAAGCCCG 

CTTAACTATG CGGCATCAGA 

CCGCACAGAT GCGTAAGGAG 

AACTGTTGGG AAGGGCGATC 

GGATGTGCTG CAAGGCGATT 

AAAACGACGG CCAGTGCCAA 

ACCTCCCACA CCTCCCCCTG 

TGTTTATTGC AGCTTATAAT 

AAGCATTTTT TTCACTGCAT 

ATGTCTGGAT CCCCGCGGCC 

CACATTGACC AGATCGCAGG 

GTAGCACAGT TTCTGATACG 

AAAGCACTCT AAACAGTCTT 

TCTCTCGCAT TGTCTGCAGG 

GTTTTGGTAC CTGTCTGCGT 

AACTGACTCG CGCACCCGTT 
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GCAGCTCCCG GAGACGGTCA 
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TCAGGGCGCG TCAGCGGGTG 
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GCAGATTGTA CTGAGAGTGC 
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AAAATACCGC ATCAGGCGCC 
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GGTGCGGGCC TCTTCGCTAT 
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AAGTTGGGTA ACGCCAGGGT 
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GCTTGCATGC CTGCAGGTCG 
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AACCTGAAAC ATAAAATGAA 
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GGTTACAAAT AAAGCAATAG 
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TCTAGTTGTG GTTTGTCCAA 
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GCCAAATCAT TTATTGTTCA 
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CAGTGCAAGC GTCTGGCACC 
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CCTTTTTGAC GACAGAAACG 
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TCTGTCCGTG AGTGAAGCAG 
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GAAACAGCAT CAGATTCATG 
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AGTTGATCGA AGCTTCCGCG 
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TGGGCTCACT TATATCTGCG 
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FIGURE 5B 



TCACTGGGGG CGGGTCTTTT CTTGGCTCCA CCCTTTTTGA CGTAGAATTC ATGCTCCACC 
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TCAACCACGT GATCCTTTGC CCACCGGAAA AAGTCTTTGA CTTCCTGCTT GGTGACCTTC 
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CCAAAGTCAT GATCCAGACG GCGGGTGAGT TCAAATTTGA ACATCCGGTC TTGCAACGGC 
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TGCTGGTGTT CGAAGGTCGT TGAGTTCCCG TCAATCACGG CGCACATGTT GGTGTTGGAG 
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GTGACGATCA CGGGAGTCGG GTCTATCTGG GCCGAGGACT TGCATTTCTG GTCCACGCGC 
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ACCTTGCTTC CTCCGAGAAT GGCTTTGGCC GACTCCACGA CCTTGGCGGT CATCTTCCCC 
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TCCTCCCACC AGATCACCAT CTTGTCGACA CAGTCGTTGA AGGGAAAGTT CTCATTGGTC 
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CAGTTTACGC ACCCGTAGAA GGGCACAGTG TGGGCTATGG CCTCCGCGAT GTTGGTCTTC 
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CCGGTAGTTG CAGGCCCAAA CAGCCAGATG GTGTTCCTCT TGCCGAACTT TTTCGTGGCC 
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CATCCCAGAA AGACGGAAGC CGCATATTGG GGATCGTACC CGTTTAGTTC CAAAATTTTA 
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TAAATCCGAT TGCTGGAAAT GTCCTCCACG GGCTGCTGGC CCACCAGGTA GTCGGGGGCG 

1680 



GTTTTAGTCA GGCTCATAAT CTTTCCCGCA TTGTCCAAGG CAGCCTTGAT TTGGGACCGC 

1740 



GAGTTGGAGG CCGCATTGAA GGAGATGTAT GAGGCCTGGT CCTCCTGGAT CCACTGCTTC 
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TCCGAGGTAA TCCCCTTGTC CACGAGCCAC CCGACCAGCT CCATGTACCT GGCTGAAGTT 
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TTTGATCTGA TCACCGGCGC ATCAGAATTG GGATTCTGAT TCTCTTTGTT CTGCTCCTGC 
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GTCTGCGACA CGTGCGTCAG ATGCTGCGCC ACCAACCGTT TACGCTCCGT GAG ATT C AAA 
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CAGGCGCTTA AATACTGTTC CATATTAGTC CACGCCCACT GGAGCTCAGG CTGGGTTTTG 
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GGGAGCAAGT AATTGGGGAT GTAGCACTCA 
TTTCTGGTCT TTGTGACCGC GAACCAGTTT 
CTCTGAATCA GTTTTTCGCG AATCTGACTC 
CCGGTGGTTT CCACGAGCAC GTGCATGTGG 
AAGAAAAGGG CCTCCGGGGC CTTACTCACA 
AGCTTCTCGG CCACGGTCAG GGGTGCCTGC 
GGCGGCAACT CCCATTCCTT CTCGGCCACC 
AGATGCCCGT CAAGGTCGCT GGGGACCTTA 
GCTGCGCGTT CAAACCTCCC GCTTCAAAAT 
TACCCAGCGT GACCACATGG TGTCGCAAAA 
CAGGACTCTA GAGGATCCCC GGGTACCGAG 
TCCTGTGTGA AATTGTTATC CGCTCACAAT 
GTGTAAAGCC TGGGGTGCCT AATGAGTGAG 
GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG 
GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC 
CTCGGTCGTT CGGCTGCGGC GAGCGGTATC 
CACAGAATCA GGGGATAACG CAGGAAAGAA 



TCCACCACCT TGTTCCCGCC TCCGGCGCCA 
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GGCAAAGTCG GCTCGATCCC GCGGTAAATT 
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AGGAAACGTC CCAAAACCAT GGATTTCACC 

2220 

AAGTAGCTCT CTCCCTTCTC AAATTGCACA 

2280 

CGGCGCCATT CCGTCAGAAA GTCGCGCTGC 

2340 

TCAATCAGAT TCAGATCCAT GTCAGAATCT 

2400 

CAGTTCACAA AGCTGTCAGA AATGCCGGGC 

2460 

ATCACAATCT CGTAAAACCC CGGCATGGCG 

2520 

GGAGACCCTG CGTGCTCACT CGGGCTTAAA 

2580 

TGTCGCAAAA CACTCACGTG ACCTCTAATA 

2640 



CTCGAATTCG TAATCATGGT CATAGCTGTT 

2700 



TCCACACAAC ATACGAGCCG GAAGCATAAA 

2760 

CTAACTCACA TTAATTGCGT TGCGCTCACT 

2820 

CCAGCTGCAT TAATGAATCG GCCAACGCGC 

2880 



TTCCGCTTCC TCGCTCACTG ACTCGCTGCG 

2940 



AGCTCACTCA AAGGCGGTAA TACGGTTATC 

3000 



CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 

3060 
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GAACCGTAAA AAGGCCGCGT 
TCACAAAAAT CGACGCTCAA 
GGCGTTTCCC CCTGGAAGCT 
ATACCTGTCC GCCTTTCTCC 
GTATCTCAGT TCGGTGTAGG 
TCAGCCCGAC CGCTGCGCCT 
CGACTTATCG CCACTGGCAG 
CGGTGCTACA GAGTTCTTGA 
TGGTATCTGC GCTCTGCTGA 
CGGCAAACAA ACCACCGCTG 
CAGAAAAAAA GGATCTCAAG 
GAACGAAAAC TCACGTTAAG 
GATCCTTTTA AATTAAAAAT 
GTCTGACAGT TACCAATGCT 
TTCATCCATA GTTGCCTGAC 
ATCTGGCCCC AGTGCTGCAA 
AGCAATAAAC CAGCCAGCCG 
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FIGURE 5D 

TGCTGGCGTT TTTCCATAGG 

GTCAGAGGTG GCGAAACCCG 

CCCTCGTGCG CTCTCCTGTT 

CTTCGGGAAG CGTGGCGCTT 

TCGTTCGCTC CAAGCTGGGC 

TATCCGGTAA CTATCGTCTT 

CAGCCACTGG TAACAGGATT 

AGTGGTGGCC TAACTACGGC 

AGCCAGTTAC CTTCGGAAAA 

GTAGCGGTGG TTtfTTTTGTT 

AAGATCCTTT GATCTTTTCT 
GGATTTTGGT CATGAGATTA 
GAAGTTTTAA ATCAATCTAA 
TAATCAGTGA GGCACCTATC 
TCCCCGTCGT GTAGATAACT 
TGATACCGCG AGACCCACGC 
GAAGGGCCGA GCGCAGAAGT 
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CTCCGCCCCC CTGACGAGCA 

3120 

ACAGGACTAT AAAGATACCA 

3180 

CCGACCCTGC CGCTTACCGG 

3240 

TCTCATAGCT CACGCTGTAG 

3300 

TGTGTGCACG AACCCCCCGT 

3360 

GAGTCCAACC CGGTAAGACA 

3420 



AGCAGAGCGA GGTATGTAGG 

3480 



TACACTAGAA GGACAGTATT 

3540 

AGAGTTGGTA GCTCTTGATC 

3600 

TGCAAGCAGC AGATTACGCG 

3660 

ACGGGGTCTG ACGCTCAGTG 

3720 

TCAAAAAGGA TCTTCACCTA 

3780 



AGTATATATG AGTAAACTTG 

3840 

TCAGCGATCT GTCTATTTCG 

3900 

ACGATACGGG AGGGCTTACC 

3960 

TCACCGGCTC CAGATTTATC 

4020 

GGTCCTGCAA CTTTATCCGC 

4080 
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CTCCATCCAG TCTATTAATT 
TTTGCGCAAC GTTGTTGCCA 
GGCTTCATTC AGCTCCGGTT 
CAAAAAAGCG GTTAGCTCCT 
GTTATCACTC ATGGTTATGG 
ATGCTTTTCT GTGACTGGTG 
ACCGAGTTGC TCTTGCCCGG 
AAAAGTGCTC ATCATTGGAA 
GTTGAGATCC AGTTCGATGT 
TTTCACCAGC GTTTCTGGGT 
AAGGGCGACA CGGAAATGTT 
TTATCAGGGT TATTGTCTCA 
AATAGGGGTT CCGCGCACAT 
TATCATGACA TTAACCTATA 
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FIGURE 5E 
GTTGCCGGGA AGCTAGAGTA 
TTGCTACAGG CATCGTGGTG 
CCCAACGATC AAGGCGAGTT 
TCGGTCCTCC GATCGTTGTC 
CAGCACTGCA TAATTCTCTT 
AGTACTCAAC CAAGTCATTC 
CGTCAATACG GGATAATACC 
AACGTTCTTC GGGGCGAAAA 
AACCCACTCG TGCACCCAAC 
GAGCAAAAAC AGGAAGGCAA 
GAATACTCAT ACTCTTCCTT 
TGAGCGGATA CATATTTGAA 
TTCCCCGAAA AGTGCCACCT 
AAAATAGGCG TATCACGAGG 
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AGTAGTTCGC CAGTTAATAG 

4140 

TCACGCTCGT CGTTTGGTAT 

4200 

ACATGATCCC CCATGTTGTG 

4260 

AGAAGTAAGT TGGCCGCAGT 

4320 

ACTGTCATGC CATCCGTAAG 

4380 

TGAGAATAGT GTATGCGGCG 

4440 

GCGCCACATA GCAGAACTTT 

4500 

CTCTCAAGGA TCTTACCGCT 

4560 

TGATCTTCAG CATCTTTTAC 

4620 

AATGCCGCAA AAAAGGGAAT 

4680 



TTTCAATATT ATTGAAGCAT 

4740 



TGTATTTAGA AAAATAAACA 

4800 

GACGTCTAAG AAACCATTAT 

4860 

CCCTTTCGTC 
4910 
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